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1 . A process of preparing unsaturated fatty acids, which comprises introducing, into 
an organism, at least one isolated nucleic acid sequence encoding a polypeptide 
having A6-desaturase activity k selected from the group consisting of: 

a) a nucleic acid sequence having the sequence shown in SEQ ID NO: 1 , 

b) nucleic acid sequences which, as a result of the degeneracy of the 
genetic code, are derived from the [lacuna] in SEQ ID NO: 1, 

c) derivatives of the nucleic acid sequence shown in SEQ ID NO: 1 which 
encode polypeptides with the amino acid sequences shown in SEQ ID 
NO: 2 and have at least 50% homology at the amino acid level without 
substantially reducing the enzymatic action of the polypeptides, 

and culturing this organism, where the cultured organism contains at least 1 mol% of 
unsaturated fatty acids based on the total fatty acid content in the organism. 

2. The process as claimed in claim 1 , wherein the nucleic acid sequence is derived 
from a plant or algae. 

3. The process as claimed in claim 1 , wherein the nucleic acid sequence is derived 
from Physcomitrella patens. 

4. The process as claimed in claim 1 , wherein the organism is an organism 
selected from the group consisting of bacterium, fungus, ciliate, algae, 
cyanobacterium, animal or plant. 

5. The process as claimed in claim 1 , wherein the organism is a plant or algae. 



The process as claimed in claim 1 , wherein the organism is an oil crops [sic]. 
The process as claimed in claim 1, wherein the cultured organism contains at 
least 5% by weight of unsaturated fatty acids based on the total fatty acid 
content in the organism. 

The process as claimed in claim 1, wherein the unsaturated fatty acids are 
isolated from the organism. • - 
A transgenic organism selected from the group consisting of plants, fungi, 
ciliates, algae, bacteria, cyanobacteria or animals comprising at least one 
isolated nucleic acid sequence encoding a polypeptide with A6-desaturase 
activity, selected from the group consisting of: 

a) a nucleic acid sequence having the sequence shown in SEQ ID NO: 1 , 

b) nucleic acid sequences^which, as a result of the degeneracy of the 
genetic code, are derived from the [lacuna) in SEQ ID NO: 1 , 

c) derivatives of the nucleic acid sequence shown in SEQ ID NO: 1 which 
encode polypeptides with the amino acid sequences shown in SEQ ID 
NO: 2 and have at least 50% homology at the amino acid level without 
substantially reducing the enzymatic action of the polypeptides. 

A transgenic organism as claimed in claim 9, wherein the organism is a plant or 
algae. 

An oil, lipid or fatty acid or a fraction thereof, prepared by the process as claimed 
in claim 1. 

The use of the oil, lipid or fatty acid composition as claimed in claim 11 or of a 
transgenic organism in feed, foodstuffs, cosmetics or pharmaceuticals. 
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DESATURASE GENES AND THEIR USE 

The present invention relates, inter alia, to novel desaturases and to uses thereof. 

Over the last few years a number of microsomal and soluble fatty acid desaturases have 
been isolated from higher plants, most notably from Arabidopsis thaliana. This has 
resulted from a combined genetic and biochemical approach to the generation and 
complementation of mutant Arabidopsis lines defective in fatty acid desaturation or 
elongation (Somerville C, Browse J (1996) Trends Cell Biol 6, 148-1153). The importance 
of this approach has been validated by the isolation and characterisation of genes encoding 
microsomal desaturases such the A 12 (Okuley J, et al (1994), Plant Cell 6, 147-158) 
andA 15 (Arondel V, et al (1992) Science 258, 1353-1355) desaturases (encoded by the 
FAD2 and FAD3 genes respectively), enzymes which had previously proved intractable to 
classical purification techniques on account of their hydrophobicity. The isolation of these 
and related enzymes, such as the A 12 hydroxylase from Ricinus communis (van de Loo FN 
et al (1995) Proc. Natl Acad. Sci USA 92, 6743-6747), has allowed the identification of a 
number of conserved motifs in plant microsomal desaturases, most notably the so-called 
"histidine boxes' 1 (Shanklin, J et al (1997) Proc Natl Acad. Sci USA. 92, 6743-6747). 
Proteins containing these motifs can be classified as di-iron centre-containing enzymes 
(Shanklin, J et al (1997) Proc. Natl Acad Sci. USA 94, 2981-1986). 

W093/11245 (Du Pont) discloses various nucleic acid fragments encoding desaturases, 
particularly A 12 and A 15 desaturases, which have been isolated from various plants. 
Recently a cDNA clone was isolated from the plant borage, (Borago officinalis) which 
accumulates y-linoleic acid (GLA), using highly degenerate PCR against these histidine 
motifs. US56 14393 (Rhone-Poulenc Agrochimie) discloses and claims the nucleotide 
sequence of borage A 6 desaturase. Whilst the specification suggests that A 6 
desaturase-encoding nucleic acids might be isolated from animal cells without difficulty by 
the skilled person no suitable animal cells are suggested (in contrast to suggested fungal 
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and bacterial cells) and there is no disclosure of the isolation of such nucleic acids from 
animal cells. The isolated DNA clone was shown by heterologous expression in transgenic 
tobacco to encode a microsomal A 6 desaturase (Sayanova O et al (1997) Proc. Natl. Acad. 
Sci USA. 94, 4211-4216). Desaturation at the A 6 position is an unusual modification in 
higher plants, occurring only in a small number of species such as borage, evening 
primrose (Oenothera spp.) and redcurrant (Ribes spp.), which accumulate the A 6 - 
unsaturated fatty acids GLA and octadecatetraenoic acid (OTAIS: 4 * 691215 - also known as 
stearidonic acid) in the seeds and/or leaves. 

GLA is a high value plant fatty acid, and is widely used in the treatment of a number of 
medical conditions, including eczema and mastalgia. It has been postulated that the 
application of GLA replaces the loss of, or meets an increased requirement for, endogenous 
A 6 -unsaturated fatty acids (Horrobin, D.F. (1990) Rev. Contemp. Pharmacother. 1: 1-45). 

For reference purposes Figure 5 is provided to show in simplified form a metabolic 
pathway believed to occur in certain organisms (including humans) and involving A 6 
desaturates. It can be seen that GLA can be synthesised in vivo from linoleic acid under the 
action of a A 6 desaturase and that GLA can be used to synthesise dihomo-GLA, which can 
be converted to arachidonic acid under the influence of a A 5 desaturase. Arachidonic acid 
is a precursor of various important eicosanoids (including prostaglandins and leucotrienes). 

A 6 desaturase also converts a linoleic acid into OTA. Thus it is clear that the A 6 desaturase 
is the first committed step on the biosynthetic pathway of these biologically active 

molecules (see Fig. 5). 

The sequence of the previously isolated borage microsomal A 6 desaturase differs from 
previously characterised plant microsomal desaturases/hydroxylases in that it contains an 
N-terminal extension which shows homology to cytochrome b 5 , and also in that the third 
(most C-terminal) histidine box varies from the consensus (Shanklin J et al (1997) Proc. 
Natl. Acad. Sci. USA 94, 2981-1986) H-X-X-H-H, with a glutamine replacing the first 
histidine. This was also observed in the case of the cyanobacteria Synechocystis A 6 
desaturase (GenBank ID; LI 1421). WO93/06712 (Rhone Poulenc Agrochimie) discloses 
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an isolated nucleic acid encoding a A 6 desaturase isolated from the Synechocystis, and 
claims bacterial A 6 desaturases and their uses. 

Although A 6 fatty acid desaturation is an unusual modification in higher plants, it is 
believed to be common in animals. The essential fatty acid linoleic acid (18:2 A 912 ) is 
desaturated to GLA by a A 6 desaturase as a first step in the biosynthetic pathway of the 
eicosanoids (which include prostagladins and leucotrienes). This results in the rapid 
metabolism of GLA (to di-homo-GLA and arachidonic acid; i.e. 20:3A 81114 and 20:4 
^5,8.11.14 reS p ec tively). Accumulation of GLA is therefore not usually observed. 

The nematode worm Caenorhabitis elegans is extremely useful in that it has well 
understood genetics and has many similarities with higher animals such as humans and is 
therefore extremely useful in the development of desaturases for use in such animals. 

According to the present invention, there is provided a polypeptide having desaturase activity, 
which comprises the amino acid sequence shown in Figure 1 . 

The amino acid sequence shown in Figure 1 is that of a A 6 desaturase that is present in the 
nematode worm Caenorhabditis elegans. This is highly significant since prior to the present 
invention no successful sequencing or purification of an animal A 6 desaturase had been 
reported. As C. elegans does not accumulate GLA isolation of a A 6 desaturase from it was an 
unexpected target for isolating desaturases gene in. 

The desaturase of the invention is significantly different from known desaturases. The 
homology between the A 6 desaturase of the invention and the microsomal A 12 and A 15 
desaturases from Arabidopsis described in W093/11245 are 24% and 16% respectively as 
determined using the BESTFTT program. The A 6 desaturase gene of the present invention 
shows 21% identity with the C elegans FAT-1 desaturase described in Spychalla, J. P. et al 
Proc. Natl Acad. Sci 94 1 142-1 147 paper. The sequence homology between the A 6 desaturase 
of the present invention and the Synechochocystis A 6 described in WO93/067 12 is only 23%. 

According to another aspect of the invention there is provided therefore an isolated animal A 6 
desaturase. 
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The amino acid sequence shown in Figure 1 is also of significance because it has a very low 
level of sequence identity with the borage A 6 desaturase (the only other eukaryotic A 6 
desaturase to have been sequenced prior to the present invention). Indeed, this level of 
sequence identity is below 32 %. At such a low level of identity it might be expected that the 
two polypeptides would have completely different functions. Unexpectedly, both have A 6 
desaturase activity. 

The present invention is, however, not limited to a A 6 desaturase having the sequence shown 
in Figure 1. It also includes other desaturases having at least 32% sequence identity therewith. 
Preferred polypeptides of the present invention have at least 40 % or more preferably at least 
50% amino acid sequence identity therewith. More preferably the degree of sequence identity 
is at least 75%. Sequence identities of at least 90%, at least 95% or at least 99% are most 
preferred. 

For the purposes of the present invention, sequence identity (whether amino acid or 
nucleotide) can be determined by using the "BESTFTT" program of the Wisconsin Sequence 
Analysis Package GCG 8.0. 

Where high degrees of sequence identity are present there may be relatively few differences 
in amino acid sequence. Thus for example there may be less than 20, less than 10, or even 
less than 5 differences. 

Fragments of the polypeptides described above are also within the scope of the present 
invention, provided that they have desaturase activity, that is to say they have the ability to 
introduce a double bond into a substrate at a specific position as determined by GCMS. What 
is the lowest limit for activity. These fragments are preferably at least 100 amino acids long 
More preferably, the fragments are at least 150 amino acids long. 

In summary, a polypeptide of the present invention has desaturase activity and: 
a) comprises the amino acid sequence shown in Figure 1 ; 
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b) has one or more amino acid deletions, insertions or substitutions relative to a 
polypeptide as defined in a) above, but has at least 32% amino acid sequence identity 
therewith; or 

c) is a fragment of a polypeptide as defined in a) or b) above, which is at least 100 amino 
acids long. 

The term "polypeptide" is used herein in a broad sense to indicate that a particular molecule 
comprises a plurality of amino acids joined together by peptide bonds. It therefore includes 
within its scope substances, which may sometimes be referred to in the literature as 
peptides, polypeptides or proteins. 

Desirably a polypeptide of the present invention will have a cytochrome domain. A 
cytochrome domain can be defined as an electron-transporting domain that contains a heme 
prosthetic group. Preferably a cytochrome b domain is present. More preferably a cytochrome 
b 5 domain is present (desirably this includes a H-P-G-G-X15-F-X3-6-H, where X is any amino 
acid, motif). A cytochrome b 5 domain is present in both the borage A 6 desaturase and in the 
C. elegans A 6 desaturase amino acid sequence shown in Figure 2B The cytochrome b 5 domain 
is preferably an N-terminal domain - i.e. it is closer to the N-terminal end of the desaturase 
than to the C-terminal end. This contrasts with other desaturases. For example, yeast A 9 
desaturase, has a C-terminal cytochrome b 5 domain and plant A 12 and A 15 desasturases which 
do not have any b 5 domain. 

A polypeptide of the present invention preferably has one or more (most preferably three) 
histidine boxes. One of these may have an H-»Q substitution. (This provides a variant 
histidine box that is believed to be conserved over a range of animal / plant species.) 

Polypeptides of the present invention can have any regiospecificity including cisltrans 
activity although it is preferred that they are front end desaturases that introduce a double 
bond between the C3 and C7 positions, measured from the COOH (A end) of the group. A 
skilled person is readily able to distinguish between different desaturases by determining the 
different positions of double bonds introduced by the desaturases. This can be done by known 
analytical techniques e.g. by using gas chromatography and mass spectrometry. 
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Particularly preferred desaturases of the invention are A 6 desaturases. 

Desirably the desaturases occur naturally in one or more organisms that do not accumulate 
GLA (i.e. where GLA may be produced, but is not normally detectable because it is very 
quickly metabolised). Such desaturases may occur naturally in one or more animals. The 
desaturases occur naturally in one or more nematodes, e.g. in C. elegans. 

In order to appreciate the scope of the present invention more fully, polypeptides within the 
scope of each of a), b) and c) above will now be discussed in greater detail. 

Polypeptides within the scope of a) 

A polypeptide within the scope of a) may consist of the amino acid sequence shown in Figure 
1 or may have an additional N-terminal and/or an additional C-terminal amino acid sequence. 

Additional N-terminal or C-terminal sequences may be provided for various reasons and 
techniques for providing such additional sequences are well known in the art. Such 
techniques include using gene-cloning techniques whereby nucleic acid molecules are ligated 
together and are then used to express a polypeptide in an appropriate host. 

Additional sequences may be provided in order to alter the characteristics of a particular 
polypeptide. This can be useful in improving expression or regulation of expression in 
particular expression systems. For example, ah additional sequence may provide some 
protection against proteolytic cleavage. 

Additional sequences can also be useful in altering the properties of a polypeptide to aid in 
identification or purification. For example, a signal sequence may be present to direct the 
transport of the polypeptide to a particular location within a cell or to export the polypeptide 
from the cell. Different signal sequences can be used for different expression systems. 

Another example of the provision of an additional sequence is where a polypeptide is linked 
to a moiety capable of being isolated by affinity chromatography. The moiety may be an 
epitope and the affinity column may comprise immobilised antibodies or immobilised 
antibody fragments that bind to said epitope (desirably with a high degree of specificity). The 
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resultant fusion protein can usually be eluted from the column by addition of an appropriate 
buffer. 

Additional N-terminal or C-terminal sequences may, however, be present simply as a result of 
a particular technique used to obtain a polypeptide of the present invention and need not 
provide any particular advantageous characteristic. 

Polypeptides within the scope ofb) 

Turning now to the polypeptides defined in b) above, it will be appreciated that these are 
variants of the polypeptides given in a) above. 

Various changes can often be made to the amino acid sequence of a polypeptide which has a 
desired property in order to produce variants which still have that property. Such variants of 
the polypeptides described in a) above are within the scope of the present invention and are 
discussed in greater detail in sections (i) to (Hi) below. They include allelic and non-allelic 
variants. 

(i) Substitutions 

An example of a variant of the present invention is a polypeptide as defined in a) above, apart 
from the substitution of one or more amino acids with one or more other amino acids. 

The skilled person is aware that various amino acids have similar characteristics. One or 
more such amino acids of a polypeptide can often be substituted by one or more other such 
amino acids without eliminating a desired property of that polypeptide (such as desaturase 
activity). 

For example, the amino acids glycine, alanine, valine, leucine and isoleucine can often be 
substituted for one another (amino acids having aliphatic side chains). Of these possible 
substitutions it is preferred that glycine and alanine are used to substitute for one another 
(since they have relatively short side chains) and that valine, leucine and isoleucine are used 
to substitute for one another (since they have larger aliphatic side chains which are 
hydrophobic). Other amino acids that can often be substituted for one another include 
phenylalanine, tyrosine and tryptophan (amino acids having aromatic side chains); lysine, 
arginine and histidine (amino acids having basic side chains); aspartate and glutamate (amino 
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acids having acidic side chains); asparagine and glutamine (amino acids having amide side 
chains); and cysteine and methionine (amino acids having sulphur containing side chains). 

Substitutions of this nature are often referred to as "conservative" or "semi-conservative" 
amino acid substitutions. 

(ii) Deletions 

Amino acid deletions can be advantageous since the overall length and the molecular weight 
of a polypeptide can be reduced whilst still retaining a desired activity. This can enable the 
amount of polypeptide required for a particular purpose to be reduced. 

(Hi) Insertions 

Amino acid insertions relative to a polypeptide as defined in a) above can also be made. This 
may be done to alter the nature of the polypeptide (e.g. to assist in identification, purification 
or expression). 

Polypeptides incorporating amino acid changes (whether substitutions, deletions or 
insertions) relative to the sequence of a polypeptide as defined in a) above can be provided 
using any suitable techniques. For example, a nucleic acid sequence incorporating a desired 
sequence change can be provided by site-directed mutagenesis. This can then be used to 
allow the expression of a polypeptide having a corresponding change in its amino acid 
sequence. 

Polypeptides within the scope ofc) 

As discussed supra, it is often advantageous to reduce the length of a polypeptide. Feature c) 
of the present invention therefore covers fragments of the polypeptides a) or b) above which 
arc at least 100 amino acids long, but which do not need to be as long as the full length 
polypeptide shown in Figure 1. Desirably these fragments are at least 200, at least 300 or at 
least 400 amino acids long. 

Various uses of the polypeptides of the present invention will now be described by way of 
example only. 
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Polypeptides of the present invention may be used, inter alia, in obtaining useful molecules. 
For example A 6 desaturases can be used in obtaining y-linolenic acid (GLA) or in obtaining 
metabolites in respect of which GLA is a precursor. For example, octadecatetraenoic acid 
(OTA; 18:4A 6 ' 9,12,15 ), a member of the n-3 (or co-3) fatty acids may be produced by the 
A 6 -desaturation of a-linolenic acid. 

GLA, OTA and their metabolites are useful in medicine. They can be used in the preparation 
of a medicament for treating a disorder involving a deficiency in GLA or of a metabolite 
derived in vivo from GLA (e.g. an eicosanoid). Disorders which may be treated include 
eczema, mastalgia, hypercholesterolemia, atherosclerosis, coronary disease, diabetic 
neuropathy, viral infections, acne, hypertension, cirrhosis and cancer. 

The metabolites may be produced in vivo in suitable hosts or in vitro. 

When a metabolite is to be produced in vitro, a desaturase of the present invention and its 
substrate will normally be provided separately and then combined when it is desired to 
produce the metabolite. The present invention therefore includes within its scope a method of 
making GLA or OTA comprising using a A 6 desaturase of the present invention to convert 
linoleic acid substrate or a-linolenic acid substrate to GLA or OTA respectively. 

When a metabolite is to be produced in vivo in a organism such as a plant or animal, the 
substrate for a desaturase of the present invention will normally be provided by the relevant 
non-human organism itself. In vivo production of the metabolite can therefore be achieved by 
inserting a gene encoding a desaturase of the present invention into the organism and 
allowing the organism to express the desaturase. The desaturase can then act on its substrate. 
It will therefore be appreciated that polypeptides of the present invention can be used to 
provide desaturase activity in organisms that would normally not possess such activity or to 
increase the level of desaturase activity in organisms already having some desaturase activity. 
If desired, a useful metabolite may be purified from such an organism. Alternatively the 
organism itself may be used directly as a source of the metabolite. Particular cloning 
techniques that can be used to provide transgenic organisms with desaturase activity are 
discussed later on. 
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Polypeptides of the present invention can also be used as indicators of the transformation of 
an organism. For example, if an organism intended to be transformed does not have a 
particular desaturase and a nucleic acid intended for use in transformation encodes that 
desaturase, an assay can be performed after attempted transformation to determine whether or 
not the desaturase is present. Thus, in the case of the A 6 desaturase, an assay for the presence 
of GLA may be performed and GLA can serve as a simple marker for the presence of a 
functional transgene cassette comprising a A 6 desaturase encoding sequence. 

A further use of the present invention is in providing antibodies. The present invention 
includes within its scope antibodies that bind to polypeptides of the present invention. 

Preferred antibodies bind specifically to polypeptides of the present invention and can 
therefore be used to purify such polypeptides. (For example, they may be immobilised and 
used to bind to polypeptides of the present invention. The polypeptides may then be eluted 
by washing with a suitable eluent under appropriate conditions.) 

An antibody or a derivative thereof within the scope of the present invention may be used in 
diagnosis. For example binding assays using such an antibody or a derivative can be used to 
determine whether or not a particular desaturase is present. This is useful in diagnosing 
disorders that arise due to the absence of the functional desaturase. 

Antibodies within the scope of the present invention may be monoclonal or polyclonal. 

Polyclonal antibodies can be raised by stimulating their production in a suitable animal host 
(e.g. a mouse, rat, guinea pig, rabbit, sheep, goat or monkey) when a polypeptide of the 
present invention is injected into the animal. If necessary an adjuvant may be administered 
together with a polypeptide of the present invention. The antibodies can then be purified by 
virtue of their binding to a polypeptide of the present invention. 

Monoclonal antibodies can be produced from hybridomas. These can be formed by fusing 
myeloma cells and spleen cells which produce the desired antibody in order to form an 
immortal cell line. Thus the well-known Kohler & Milstein technique {Nature 256 52-55 
(1975)) or variations upon this technique can be used. 



WO 99/27111 




PCT/GB98/03507 



Techniques for producing monoclonal and polyclonal antibodies that bind to a particular 
polypeptide are now well developed in the art. They are discussed in standard immunology 
textbooks, for example in Roitt et al, Immunology second edition (1989), Churchill 
Livingstone, London. 

In addition to whole antibodies, the present invention includes derivatives thereof which are 
capable of binding to polypeptides of the present invention. Thus the present invention 
includes antibody fragments and synthetic constructs. Examples of antibody fragments and 
synthetic constructs are given by Dougall et al in Tibtech 12 372-379 (September 1994). 

Antibody fragments include, for example, Fab, F(ab') 2 and Fv fragments. (These are 
discussed, for example, in Roitt et al (supra)). Fv fragments can be modified to produce a 
synthetic construct known as a single chain Fv (scFv) molecule. This includes a peptide 
linker covalently joining V h and Vi regions, which contributes to the stability of the molecule. 
Other synthetic constructs that can be used include CDR peptides. These are synthetic 
peptides comprising antigen-binding determinants. Peptide mimetics may also be used. 
These molecules are usually conformationally restricted organic rings that mimic the structure 
of a CDR loop and that include antigen-interactive side chains. 

Synthetic constructs include chimaeric molecules. Thus, for example, humanised (or 
primatised) antibodies or derivatives thereof are within the scope of the present invention. An 
example of a humanised antibody is an antibody having human framework regions, but 
rodent hypervariable regions. 

Synthetic constructs also include molecules comprising an additional moiety which provides 
the molecule with some desirable property in addition to antigen binding. For example the 
moiety may be a label (e.g. a fluorescent or radioactive label). Alternatively, it may be a 
pharmaceutical^ active agent. 

The present invention also includes nucleic acid molecules within its scope. 
Such nucleic acid molecules: 

a) code for a polypeptide according to the present invention; or 
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b) are complementary to molecules as defined in a) above; or 

c) hybridise to molecules as defined in a) or b) above. 

These nucleic acid molecules and their uses are discussed in greater detail below: 

The polypeptides of the present invention can be coded for by a large variety of nucleic acid 
molecules, taking into account the well-known degeneracy of the genetic code. All of these 
coding nucleic acid molecules are within the scope of the present invention. Preferred coding 
nucleic acid molecules encode the polypeptide shown in Figure 1. These include nucleic acid 
molecules comprising the coding sequence shown in Figure 1 and degenerate variants 
thereof. 

The nucleic acid molecules may be used directly. Alternatively they may be inserted into 
vectors. 

Nucleic acids or vectors containing them may be used in cloning. They may be introduced 
into non-human hosts to enable the expression of polypeptides of the present invention using 
techniques known to those skilled in the art. Alternatively, cell free expression systems may 
be used. 

Techniques for cloning, expressing and purifying polypeptides are well known to the skilled 
person. Various such techniques are disclosed in standard text-books, such as in Sambrook et 
al (Molecular Cloning 2nd Edition, Cold Spring Harbor Laboratory Press (1989)); in Old & 
Primrose (Principles of Gene Manipulation, 5th Edition, Blackwell Scientific Publications 
(1994)); and in Stryer (Biochemistry, 4th Edition, W H Freeman and Company (1995)). 

By using an appropriate expression system the polypeptides can be produced in a desired 
form. For example, the polypeptides can be produced by micro-organisms such as bacteria 
or yeast, by cultured insect cells (which may be baculovirus-infected), or by mammalian 
cells (such as CHO cells). 

However preferred hosts are plants or plant propagating material e.g. oil seed rape, 
sunflower, cereals including maize, tobacco, legumes including peanut and soybean, 
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safflower, oil palm, coconut and other palms, cotton, sesame, mustard, linseed, castor, 
borage and evening primrose, or propagating material therefor. 

The technology for providing plants or plant propagating material is now well developed. It is 
briefly discussed in WO 96/21022, for example. Desaturases isolated from animals have 
successfully been expressed in plants. For example, Spychalla, J.P. et al, (supra) describe 
the expression of a C. elegans desaturase in transgenic Arabidopsis. Additionally, 
EP0550162 (Pioneer Hi-Bred International, Inc) discloses a chimaric gene construct 
encoding a A 9 desaturase isolated from rat, and plants transformed with the construct for 
the production of fatty acids. The desaturase described in that publication has only 22% 
identity with the A 6 desaturase of the present invention. 

Particular techniques that can be used are discussed below. It will of course be appreciated 
that such techniques are non-limiting. 

(i) Vector systems based on Agrobacterium tumefaciens. 

These include Ti based systems, such as pGV3850, in which the T-DNA has been disarmed. 
Desirably a selectable marker is present (e.g. a marker that provides resistance to an 
antibiotic). 

Intermediate vectors (IVs) may also be used. They tend to be small in size and are therefore 
usually easier to manipulate than large Ti based vectors. IVs are generally vectors resulting 
from T-DNA having been cloned into £. coli derived plasmid vectors, such as pBR322. IVs 
are often conjugation-deficient and therefore a conjugation-proficient plasmid (such as 
pRK2013) may be used to mobilise an IV so that it can be transferred to an Agrobacterium 
recipient. In vivo homologous recombination can then occur in an Agrobacterium to allow an 
IV to be inserted into a resident, disarmed Ti plasmid in order that a cointegrate can be 
produced that is capable of replicating autonomously in the Agrobacterium. 

Another alternative is to use binary Ti vectors. Here a modified T-DNA region carrying 
foreign DNA can be provided on a small plasmid that replicates in E. coli (e.g. pRK252). 
This plasmid (sometimes called mini-Ti or micro-Ti) can then be transferred conjugatively 
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via a tri-parental mating into an A. tumefaciens that contains a compatible vir gene 
(providing the vir function in trans). 

Binary vectors without Ti sequences may even be used. Here bacterial mob and orfT 
functions may be used to promote plasmid transfer. Again, the vir function may be provided 
in trans . 

The vector systems discussed above can be used to transfer genes into plants by using the 
protocol of Horsch et al. {Science 227, 1229-31 (1985)) or variants thereof. Here small discs 
can be punched from the leaves of a dicotyledenous plant, they can be surface-sterilised, and 
can then be placed in a medium including A. tumefaciens that contains recombinant T-DNA 
in which a foreign gene to be transferred is accompanied by a selectable marker (e.g. the neo 
gene). The discs can then be cultured for 2 days and then transferred to a medium for 
selecting the selectable marker. (This can be done for a neo selectable marker by culturing 
using a medium containing kanamycin). A. tumefaciens can be killed by using a carbenicillin 
containing medium. Shoots will normally develop from a callus after 2-4 weeks. They can 
then be excised and transplanted to root-inducing medium and, when large enough can be 
transplanted into soil. 

(ii) Vector systems based on Agrobacterium rhizogenes 

These include Ri derived plasmids. Ri T-DNA is generally considered not to be deleterious 
and therefore such plasmids can be considered as equivalent to disarmed Ti plasmids. An IV 
co-integrate system based on Ri plasmids has been developed. 

(Hi) Plant protoplast based transformation systems 

Suitable techniques are described in "Plant Gene Transfer and Expression Protocols" ed. H. 
Jones, Human Press Methods in Molecular Biology, 49, 1995. 

Transformation of plants can be facilitated by removing plant cell walls to provide 
protoplasts. The cell walls can be removed by any suitable means, including mechanical 
disruption or treatment with cellulolytic and pectinolytic enzymes. Protoplasts can then be 
separated from other components by centrifugation and techniques such as electroporation 
can then be used to transform the protoplasts with heterologous DNA. Under appropriate 
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culture conditions the transformed protoplasts will grow new cell walls and also divide. 
Shoots and roots can then be induced and plantlets formed. 

(iv) Transfection by biolistics 

High velocity microprojectiles carrying DNA or RNA can be used to deliver that DNA or 
RNA into plant cells. This has allowed a wide variety of transgenic plants to be produced and 
is suitable for both monocotyledenous and dicotyledenous plants. For example gold or 
tungsten particles coated with DNA or RNA can be used. Suitable devices for propelling the 
microprojectiles include gunpowder based devices, electric discharge based devices and 
pneumatic devices. 

(v) Virus based systems 

DNA plant virus vectors include cauliflower mosaic viruses (which infect a range of dicots.) 
and geminiviruses ( which infect a wide range of dicots. and monocots). RNA plant viruses 
are in the majority and include Brome Mosaic Virus (which infects a number of Graminae, 
including barley) and Tobacco Mosaic Virus (which infects tobacco plants). 

From the foregoing description it will be appreciated that nucleic acid molecules encoding 
polypeptides of the present invention can be cloned and expressed in a wide variety of 
organisms. 

In addition to nucleic acid molecules coding for polypeptides of the present invention 
(referred to herein as "coding" nucleic acid molecules), the present invention also includes 
nucleic acid molecules complementary thereto. Thus, for example, both strands of a double 
stranded nucleic acid molecule are included within the scope of the present invention 
(whether or not they are associated with one another). Also included are mRNA molecules 
and complementary DNA molecules (e.g. cDNA molecules). 



Nucleic acid molecules that can hybridise to one or more of the nucleic acid molecules 
discussed above are also covered by the present invention. Such nucleic acid molecules are 
referred to herein as "hybridising" nucleic acid molecules. 
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A hybridising nucleic acid molecule of the present invention may have a high degree of 
sequence identity along its length with a nucleic acid molecule within the scope of a) or b) 
above (e.g. at least 50%, at least 75% or at least 90% sequence identity). 

As will be appreciated by those skilled in the art, the greater the degree of sequence identity 
that a given single stranded nucleic acid molecule has with another single stranded nucleic 
acid molecule, the greater the likelihood that it will hybridise to a single stranded nucleic acid 
molecule which is complementary to that other single stranded nucleic acid molecule under 
appropriate conditions. 

Desirably hybridising molecules of the present invention are at least 10 nucleotides in length 
and preferably are at least 25, at least 50, at least 100 or at least 200 nucleotides in length. 

Preferred hybridising molecules hybridise under stringent hybridisation conditions. One 
example of stringent hybridisation conditions is where attempted hybridisation is carried out 
at a temperature of from about 35°C to about 65°C using a salt solution that is about 0.9 
molar. However, the skilled person will be able to vary such parameters as appropriate in 
order to take into account variables such as probe length, base composition, type of ions 
present, etc. 

Most preferably, hybridising nucleic acid molecules of the present invention hybridise to a 
DNA molecule having the coding sequence shown in Figure 1 to an RN A equivalent thereof, 
or to a complementary sequence to either of the aforesaid molecules. 

Hybridising nucleic acid molecules can be useful as probes or primers, for example. 

Probes can be used to purify and/or to identify nucleic acids. For example they can be used to 
identify the presence of all or part of a desaturase gene and are therefore useful in diagnosis. 

Primers are useful in amplifying nucleic acids or parts thereof, e.g. by PCR techniques. 

In addition to being used as probes or primers, hybridising nucleic acid molecules of the 
present invention can be used as antisense molecules to alter the expression of polypeptides 
of the present invention by binding to complementary nucleic acid molecules. (Generally this 
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can be achieved by providing nucleic acid molecules that bind to RNA molecules that would 
normally be translated, thereby preventing translation due to the formation of duplexes.) 

Hybridising molecules may also be provided as ribozymes. Ribozymes can also be used to 
regulate expression by binding to and cleaving RNA molecules that include particular target 
sequences recognised by the ribozymes. 

From the foregoing discussion it will be appreciated that a large number of nucleic acids are 
within the scope of the present invention. Unless the context indicates otherwise, nucleic acid 
molecules of the present invention may therefore have one or more of the following 
characteristics: 

1) They may be DNA or RNA (including variants of naturally occurring DNA or RNA 
structures, which have non-naturally occurring bases and/or non-naturally occurring 
backbones). 

2) They may be single or double stranded. 

3) They may be provided in recombinant form i.e. covalently linked to a heterologous 5* 
and/or 3' flanking sequence to provide a chimaeric molecule (e.g. a vector) which 
does not occur in nature. 

4) They may be provided without 5' and/or 3* flanking sequences that normally occur in 
nature. 

5) They may be provided in substantially pure form, e.g. by using probes to isolate 
cloned molecules having a desired target sequence or by using chemical synthesis 
techniques. Thus they may be provided in a form which is substantially free from 
contaminating proteins and/or from other nucleic acids. 



6) They may be provided with introns (e.g. as a full-length gene) or without introns (e.g. 
as cDNA). 
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The present invention will now be described by way of example only, with reference to the 
accompanying drawings, Figures 1 to 6 wherein: 

Fig 1 shows the DNA sequence and the deduced amino acid sequence of the full length 
C. elegans cDNA pCeD6.1 . The positions of the N-terminal cytochrome b 5 domain and the 
variant third histidine box are underlined. The deduced amino acid sequence of this cDNA 
is identical to that predicted for residues 1-38 and 68-473 of W08D2.4. 

Fig 2A shows a comparison of the deduced amino acid sequences of the C. elegans cDNA 
CeD6.1 and the C. elegans predicted protein W08D2.4. (MywormD6=CeD6.1; 
cew08d2=ORF W08D2.4.) 

Fig 2B shows a comparison of the deduced amino acid sequences of the borage A 6 
desaturase (Sayanova O et al (1997) Proc. Natl Acad. ScL USA 94, 421 1-4216) and the C. 
elegans cDN A CeD6. 1 . ( Boofd6=Borage officianalis A 6 desaturase; ceeld6=CeD6. 1 .) 

Fig 3 shows methyl esters of total lipids of 5. cerevisiae grown under inducing conditions 
(linololate and galactose). 

Fig 4 shows GC-MS analysis of the novel peak identified in yeast carrying pYCeD6.1. 

Fig 5 shows a simplified version of the metabolism of n-6 essential fatty acids in mammals. 

Fig 6 shows fatty acid and methyl esthers of leaf material from either control transformed 
Arabidopsis plant (A) or transformed Arabidopsis plant expressing the C. elegansA 6 
desaturase (B). 

Example 1 - Isolation of A 6 Desaturase Gene and Expression in Yeast 

The NCBI EST sequence database was searched for amino acid sequences using a known 
borage A 6 fatty acid desaturase (Sayanova O et al (1997) supra) and limiting the search to 
sequences containing a variant histidine box Q-X-X-H-H. 
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C elegans ESTs were identified. They were further characterised by searching the 
C. elegans EST project database (Prof. Y. Kohara lab (National Institute of Genetics, 
Mishima, Japan); DNA Database of Japan) to identify related cosmid clones. 

A partial 448 base pair cDNA clone designated as yk436bl2 identified by these searches 
was obtained from the C elegans EST project, and this was used to screen a C. elegans 
cDNA library (mixed stage; also supplied by Prof Kohara) This indicated that the clone 
yk436bl2 was homologous to part of a gene present on cosmid W08D2 (Genbank 
accession number Z70271), which forms part of chromosome IV. Bases 21-2957 of cosmid 
W0D2 are predicted by the protein prediction program Genefinder (Wilson R et al (1994) 
Nature 368 32-38 to encode an ORF of 473 residues which is interrupted by 5 introns. 
Wilson, R. et al disclose part of the sequence of chromosome HI of C. elegans. A number 
of positives were identified and further purified, and full length clones were confirmed by 
sequencing to encode a transcript likely to have been transcribed from the gene designated 
W08D2.4, on cosmid W08D2, as determined by database searching of the genes sequenced 
by the C. elegans genome project. 

Examination of this predicted polypeptide (designated W08D2.4 by the Sanger Centre 
Nematode Sequencing Project, Hinxton, UK) revealed that it had a number of 
characteristics reminiscent of a microsomal fatty acid desaturase, including three histidine 
boxes^ However, the predicted protein sequence indicated the presence of an N-terminal 
domain similar to cytochrome b 5 , containing the diagnostic H-P-G-G motif found in 
cytochrome b 5 proteins (Lederer F (1994) Biochimie. 76, 674-692). Since the A 6 desaturase 
isolated by us from borage also contained an N-terminal b 5 domain, this indicated that 
W0D2.4 may encode a D 6 desaturase. 

Closer examination of the sequence revealed the presence of the variant third histidine box, 
with an H-»Q substitution (again as observed in the borage A 6 desaturase). The degree of 
similarity between W08D2.4 and the borage A 6 desaturase is <52% and is therefore low. 
The figure of <31% obtained for identity is also low. 



WO 99/27111 W W PCT/GB98/03507 

20 

Since W08D2.4 was encoded by a gene containing many (6) introns, it was necessary to 
isolate a full length cDNA to verify the sequence predicted by the Genefinder program , 
and to also allow the expression of the ORF to define the encoded function. 

A cDNA library was screened with the EST insert yk436bl2 (generously provided by 
Prof Y. Kohara) and a number of positive plaques were identified. These were further 
purified to homogeneity, excised, and the largest inserts (of -1450 bp) from the resulting 
rescued phagemids were sequenced. This confirmed that the cDNAs isolated by us were 
indeed homologous to W08D2.4, with the 5' and 3' ends of the cDNA being equivalent to 
bases 9 and 3079 of the sequence of cosmid W08D2. Since the ATG initiating codon 
predicted by the Genefinder program to be the start of gene product W08D2.4 was indeed 
the first methionine in the cDNA clone, we reasoned that we had isolated a bona fide full 
length cDNA. The DNA sequence and deduced amino acid sequence of one representative 
cDNA clone (termed pCeD6.1; 1463 bp in length) is shown in Fig 1; the deduced amino 
acid sequence is identical to that predicted for W08D2.4 over the majority of the protein. 
The positions of the N-terminal cytochrome b 5 domain and the variant third histidine box 
are underlined. The deduced amino acid sequence of this cDNA is identical to that 
predicted for residues 1-38 and 68-473 of W08D2.4. 

However, DNA sequences encoding residues 38-67 (Y-S-I L-Y-F) predicted for 

W08D2.4 are not present in the cDNA clone. This means that the deduced amino acid 
sequence of CeD6.1 is in fact 443 amino acids long, as opposed to that predicted for 
W08D2.4, which is 473 residues in length. The only other difference between the two 
amino acid sequences is an M-»V substitution at residue 401 , resulting from an A->G base 
change (base 1211). The two sequences are compared in Fig 2 A, as is the_deduced amino 
acid sequence of the borage A 6 desaturase and that of CeD6. 1 (Fig 2B). The extra sequence 
predicted for W08D2.4 is likely to derived from incorrect prediction of intron-exon 
borders. 

Note the presence of the H-P-G-G cytochrome b 5 motif in the N-terminus (encoded by 
bases 96-108) and the H Q substitution in the third histidine box (encoded by bases 
1157-1172). 
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The coding sequence of W08D2.4 was introduced into the yeast expression vector pYES2 
by PCR. Oligonucleotides with 5' overhangs were used to introduce Kpnl and Sad sites at 
the 5' and 3' ends respectively. The fidelity of the construct was checked by in vitro 
transcription and translation using the TnT system (Promega). 

Specifically, clone pCeD6.1 was then used as a template for PCR amplification of the 
entire predicted coding sequence (443 amino acid residues in length), and cloned into the 
yeast expression vector pYES2 (Invitrogen) to yield pYCeD6. The fidelity of this 
PCR-generated sequence was checked in vitro transcription/translation of the plasmid, 
using the T7 RNA polymerase promoter present in pYES2. 

Using the Promega TnT coupled transcription/translation system,translation products were 
generated and analysed by SDS-PAGE and autoradiography as per the manufacturer's 
instructions. This revealed (data not shown) that the plasmid pYCeD6 generated a product 
of ~55kD, whereas the control (pYES2) failed to yield any protein products, indicating that 
the construct was correct. 

The resulting plasmid was introduced into yeast (S. cerevisiae) by the lithium acetate 
method (Guthrie C, Fink GR (1991) Meths Enz 194) and expression of the transgene was 
induced by the addition of galactose. The yeast was supplemented by addition of 0.2 mM 
linoleate (sodium salt) in the presence of 1% tergitol NP-40. 

Transformation and selection of yeast able to grow on uracil-deficient medium revealed 
yeast colonies carrying the recombinant plasmid pYCeD6 by virtue of the URA3 selectable 
marker carried by pYES2. Expression of pYCeD6 was obtained by inducing the GAL 
promoter that is present in pYES2. This was carried out after the cells had been grown up 
overnight with raffinose as a carbon source, and the medium supplemented by the addition 
of linoleate (18:2) in the presence of low levels of detergent. This later addition was 
required since the normal substrate for A 6 desaturation is 18:2 fatty acids, which do not 
normally occur in S. cerevisiae. 



Yeast total fatty acids were analyzed by GC of methyl esters . Confirmation of the presence 
of GLA was carried out by GC-MS (Sayanova et al (1997) supra). 
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In more detail, the cultures were then allowed to continue to grow after induction, with 
aliquots being removed for analysis by GC. When methyl esters of total fatty acids isolated 
from yeast carrying the plasmid pYCeD6 and grown in the presence of galactose and 
linoleate were analyzed by GC, an additional peak was observed (Fig 3). In Fig. 3 Panel A 
is yeast transformed with control (empty) vector pYES2, panel B is transformed with 
pYCeD6.1. The common fatty acid-methyl esters were identified as 16:0 (peak 1), 16:1 
(peak 2), 18:0 (peak 3), 18:1 (peak 4), 18:2 (peak 5; supplied exogenously). The additional 
peak (6) in panel B corresponds to 18:3 GLA, and is indicated by an arrowhead. This had 
the same retention time as an authentic GLA standard, indicating that the transgenic yeast 
were capable of A 6 -desaturating linoleic acid. No such peaks were observed in any of the 
control samples (transformation with pYES2). The identity of this extra peak was 
confirmed by GC-MS, which positively identified the compound as GLA (Fig 4). In the 
Figure 4 experiment, the sample was analyzed for mass spectra as before (Sayanova O et al 
(1997) Proc. Natl. Acad Sci. USA 94, 421 1-4216), and the data used to search a library of 
profiles. The sample was identified as GLA. A comparison of the mass spectra of the novel 
peak (A) and authentic GLA (B) is shown; visual and computer-based inspection revealed 
them to be identical. This confirms that CeD6.1 encodes a C. elegans A 6 desaturase, and 
that this cDNA is likely to be transcribed from the gene predicted to encode ORF 
W08D2.4, though the deduced amino acid sequence of CeD6.1 is 30 residues smaller than 
that ofW08D2.4 

Example 2 -Expression of C.elegans A 6 desaturase in plants 

The coding sequence of the C. elegans A 6 desaturates was subcloned into a plant 
expression vector pJD330, which comprises a viral 35S promoter, and a Nos terminator. 
The resulting cassette or promoter/coding sequence/terminator was then subcloned into the 
plant binary transformation vector pBin 19, and the resulting plasmid was introduced into 
Agrobacterium tumefaciens. This Agrobacterium strain was then used to transform 
Arabidopsis by the vacuum-infiltration of inflorescences. Seeds were harvested and plated 
onto selective media containing kanamycin. Since pBin 19 confers resistance to this 
antibiotic, only transformed plant material will grow. Resistant lines were identified and 
self-fertilized to produce homozygous material. Leaf material was analyzed for fatty acid 
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profiles using the same method as used for the expression of the nematode desaturase in 
yeast. Fatty acid methyl esthers were separated by GC, and novel peaks shown in Figure 6 
identified by comparison with known standards and GCMS. Two novel peaks can be seen 
in (B) which were identified asy -linolenic acid (peak 1) and octadecatetraenoic acid (peak 
2). These are the products of A 6 desaturation of the precursor fatty acids linoleic acid and a 
-linolenic acid, respectively. 

The inventors have shown that a C. elegans cDNA (CeD6.1) encodes a A 6 desaturase, and 
that this sequence is identical with the predicted ORF W08D2.4, except for a 30 residue 
insertion present in the N-terminal region of the latter protein. Whether the deduced amino 
acid sequence predicted for CeD6.1 represents a splicing variant of W08D2.4, or is a result 
of a mis-prediction of the intron/exon junctions by the Genefinder programme is unclear. 
However it is clear that CeD6. 1 encodes a A 6 desaturase. 

The ORF encoded by the this C. elegans sequence appears to be related to the higher plant 
A 6 fatty acid desaturase previously isolated by us (Sayanova O et al (1997) supra), in that 
they both contain N-terminal domains which show homology to cytochrome b 5 . 
Microsomal fatty acid desaturases have been demonstrated to use free microsomal 
cytochrome b 5 as their electron donor (Smith MA, et al (1990) Biochem. J, 272, 23-29, 
Smith MA et al (1992) Biochem. J. 287, 141-144)), and the vast majority of identified 
sequences for these enzymes appear not to contain this additional cytochrome b 5 domain 
(Okuley J et al (1994) Plant Cell 6, 147-158, Aronel V. et al (1992) Science 258, 
1353-1355 and Napier, J.A. et al (1997) Biochemical 7, 328:717-8). 

Prior to the present invention only two examples of cytochrome b 5 -domain-containing 
desaturases were known, one being the borage A 6 desaturase, and the other being the yeast 
microsomal A 9 (OLE1) desaturase (Napier J A et al (1997) Biochemical 7, supra and 
Mitchell AG, Martin CE (1995) 7. Biol Chem 270, 29766-29772). OLE1, however, 
contains a C-terminal cytochrome b 5 domain (Napier J A et al (1997) Biochemical 7, in 
press and Mitchell AG, Martin CE (1995) 7. Biol. Chem. 270, 29766-29772). The reason 
for the cytochrome b 5 may be that the A 6 desaturase is a "front-end" desaturase. (A 
"front-end" desaturation can defined as the final desaturation reaction on the fatty acid 
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chain, usually introducing double bonds between a pre-existing bond and the A-end of the 
carboxy group (Mitchell AG, Martin CE (1995) J. Biol. Chem 270, 29766-29772 and 
Aitzetmuller K, Tseegsuren, N (1994) /. Plant Physiol 143, 538-543).) 

In any event, it is now believed to be the case that both a variant histidine box and an 
N-terminal cytochrome b 5 domain are conserved in both animals and plants, as evidenced 
by their presence in both the borage and nematode A 6 desaturases. 

The invention may therefore allow the identification of other A 6 desaturases and also other 
"front-end" desaturases to be identified by the presence of these motifs. 
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1 . A polypeptide having desaturase activity, which: 

a) has the amino acid sequence shown in Figure 1 

b) has one or more amino acid deletions, insertions or substitutions relative to a 
polypeptide as defined in a) above, but has at least 32% amino acid sequence identity 
therewith; or 

c) is a fragment of a polypeptide as defined in a) or b) above, which is at least 100 amino 
acids long. 

2. A polypeptide according to claim 1 , which has a cytochrome domain. 

3. A polypeptide according to claim 2, which has a cytochrome b 5 domain. 

4. A polypeptide according to any preceding claim, which has at least one histidine box. 

5. A polypeptide according to any preceding claim, which has three histidine boxes. 

6. A polypeptide according to any preceding claim, which is a front end desaturase. 

7. A polypeptide according to any preceding claim, which is a A 6 desaturase. 

8. A polypeptide according to any preceding claim, which occurs naturally in an organism 
that does not accumulate GLA. 

9. A polypeptide according to any preceding claim, which occurs naturally in a eukaryote. 

10. A polypeptide according to any preceding claim, which occurs naturally in an animal. 

1 1 . A polypeptide according to any preceding claim, which occurs naturally in a nematode. 

12. A polypeptide according to any preceding claim, which occurs naturally in C. elegans. 
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13. A polypeptide according to claim 1, which consists of the amino acid sequence shown in 
Figure 1 or of a part thereof. 

14. A polypeptide comprising a polypeptide according to any preceding claim, when 
covalently linked to another moiety. 

15. The use of a polypeptide according to any of claims 1 to 14 in raising or selecting 
antibodies. 

16. The use of a polypeptide according to any of claims 1 to 14 as a marker for 
transformation. 

17. The use of a polypeptide according to claim 16 as a marker for plant transformation. 

18. An antibody or a derivative thereof which binds to a polypeptide according to any of 
claims 1 to 14. 

19. An antibody or a derivative thereof according to claim 18, for use in diagnosis. 

20. A method for assessing whether or not an organism has a polypeptide according to any of 
claims 1 to 14, comprising determining whether or not the organism has a polypeptide 
that binds to an antibody or a derivative thereof according to claim 18. 

21 . A method according to claim 20 in which the organism is a human. 

22. A method according to claim 20 or 21 preferred in vitro. 

23. A polypeptide according to any of claims 1 to 14, for use in medicine. 

24. The use of a polypeptide according to any of claims 1 to 14 in the preparation of a 
medicament for treating a disorder involving a deficiency in GLA in a metabolite derived 
in vivo from GLA. 

25. The use of a polypeptide according to claim 23 in which the metabolite is an eicosanoid. 
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26. The use according to claim 23, 24, or 25 wherein the disorder is eczema, mastalgia, 
hypercholesterolemia, atherosclerosis, coronary disease, diabetic neuropathy, viral 
infections, acne, cirrhosis, hypertension and cancer. 

27. A method of making GLA comprising using a polypeptide according to any one of claims 
1 to 14 to convert linoleic acid to GLA 

28. A method of making OTA comprising using a polypeptide according to any one of claims 
1 to 14 to convert a linoleic acid to OTA. 

29. A nucleic acid molecule which: 

a) codes for a polypeptide according to any of claims claim 1 to 14, 

b) is the complement of a nucleic acid molecule as defined in a) above, or 

c) hybridises to a nucleic acid molecule as defined in a) or b) above. 

30. A vector comprising a nucleic acid molecule according to claim 29. 

31. A host comprising a nucleic acid molecule according to claim 27 or a vector according to 
claim 30. 

32. A host according to claim 31, which is a plant or plant propagating material. 

33. A host according to claim 3 1 or claim 32, which is oil seed rape, sunflower, cereals 
including maize, tobacco, legumes including peanut and soybean, safflower, oil palm, 
coconut and other palms, cotton, sesame, mustard, linseed, castor, borage and evening 
primrose; or which is propagating material for any of the aforesaid. 

34. A method for obtaining a polypeptide according to any of claims 1 to 14, comprising 
incubating a host according to any of claims 31 to 33 under conditions causing expression 
of said polypeptide and then purifying said polypeptide. 



35. The use of nucleic acid molecule according to claim 29 as a probe or as a primer. 
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36. The use of a nucleic acid molecule according to claim 29 or a vector according to claim 
30 for preparing an organism that accumulates GLA or a metabolite derived from GLA 
in that organism. 

37. The use of a nucleic acid molecule according to claim 29 or a vector according to claim 
30 for preparing an that is chill resistant . 

38. A method of producing a host according to any of claims 31 to 33, comprising 
incorporating a nucleic acid according to claim 29 or a vector according to claim 30 into 
an organism. 
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10 30 50 

GCTCACCAAAATGGTCGTCGACAAGAATGCCTCCGGGCTTCGAATGAAGGTCGATGGCAA 
MVVDKNASGLRMKVDGK 

70 90 HO 

ATGGCTCTACCTTAGCGAGGAATTGGTGAAGAAACATCCAGGAGGAGCTGTTATTGAACA 
WLYLSEELVKK H P G G A V I E Q 

130 150 170 

...» • 
ATATAGAAATTCGGATGCTACTCATATTTTCCACGCTTTCCACGAAGGATCTTCTCAGGC 
YRNSDATHIFHAFHEGSSQA 

190 210 230 

...» * 
TTATAAGCAACTTGACCTTCTGAAAAAGCACGGAGAGCACGATGAATTCCTTGAGAAACA 
YKQLDLLKKHGEHDEFLEKQ 

2S0 270 290 

ATTGGAAAAGAGACTTGACAAAGTTGATATCAATGTATCAGCATATGATGTCAGTGTTGC 
LEKRLDKVDINVSAYDVSVA 

310 330 350 

ACAAGAAAAGAAAATGGTTGAATCATTCGAAAAACTACGACAGAAGCTTCATGATGATGG 
QEKKMVESFEKLRQKLHDDG 

370 390 410 

ATTAATGAAAGCAAATGAAACATATTTCCTGTTTAAAGCGATTTCAACACTTTCAATTAT 
L M K A-N ETYFLFKAISTLSIM 

430 450 470 

GGCATTTGCATTTTATCTTCAGTATCTTGGATGGTATATTACTTCTGCATGTTTATTAGC 
AFAFYLQYLGWYI TSACLLA 

490 510 530 

ACTTGCATGGCAACAATTCGGATGGTTAACACATGAGTTCTGCCATCAACAGCCAACAAA 
LAWQQFGWLTHEFCHQQPTK 
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550 570 590 

GAACAGACCTTTGAATGATACTATTTCTTTGTTCTTTGGTAATTTCTTACAAGGATTTTC 
NRPLNDTI SLFFGNFLQGFS 

610 630 650 

AAG AG ATTGGTGG AAGG AC AAGC AT AAC ACT CAT C AC GCTGC C AC AAATG TAATTGATC A 
RDWWKDKHNTHHAATNV IDH 

670 690 710 

TGACGGTGATATCGACTTGGCACCACTTTTCGCATTTATTCCAGGAGATTTGTGCAAGTA 
DGD IDLAPLFAF IPGDLCKY 

730 750 770 

TAAGGCCAGCTTTGAAAAAGCAATTCTCAAGATTGTACCATATCAACATCTCTATTTCAC 
KAS F EKAI LKIVPYQHLYFT 

790 810 830 

CGCAATGCTTCCAATGCTCCGTTTCTCATGGACTGGTCAGTCAGTTCAATGGGTATTCAA 
AMLPMLRFSWTGQSVQWVFK 

850 870 890 

AGaGAATCAAATGGAGTACAAGGTCTATCAAAGAAATGCATTCTGGGAGCAAGCAACAAT 
ENQM.EYKVYQRNAFWEQATI 

910 930 950 

TGTTGGACATTGGGCTTGGGTATTCTATCAATTGTTCTTATTACCAACATGGCCACTTCG 
VGHWAWVFYQLFLLPTWPLR 

970 990 1010 

GGTTGCTTATTTCATTATTTCACAAATGGGAGGAGGCCTTTTGATTGCTCACGTAGTCAC 
VAYFIISQMGGGLLIAHVVT 

FIG. 1 CONT'D 
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1030 1050 1070 

TTTCAACCATAACTCTGTTGATAAGTATCCAGCCAATTCTCGAATTTTAAACAACTTCGC 
FNHNSVDKYPANS RILNNFA 

1090 1110 1130 

CGCTCTTCAAATTTTGACCACACGCAACATGACTCCATCTCCATTCATTGATTGGCTTTG 
ALQILTTRNMTPS PFIDWLW 

1150 1170 1190 

• ••»., 
GGGTGGACTCAATTATCAGATCGAGCACCACTTGTTCCCAACAATGCCACGTTGCAATCT 

GGLNYQIEHHLFPTMPRCNL 

1210 1230 1250 

GAATGCTTGCGTGAAATATGTGAAAGAATGGTGCAAAGAGAATAATCTTCCTTACCTCGT 
NACVKYVKEWCKENNLPYLV 

1270 1290 1310 

CGATGACTACTTTGACGGATATGCAATGAATTTGCAACAATTGAAAAATATGGCTGAGCA 
DDYFDGYAMNLQQLKNMAEH 

1330 1350 1370 

• • • • • • 
CATTCAAGCTAAAGCTGCCTAAACAATCTGGGTGTTCAAAAAGTTTTTTCTTGTTTTTTT 

I Q A K A A * 

1390 1410 1430 

AAATTTAATTCTTTGAAATTATTTGTTTTCCGTCATTCTTCCTCCATTCCCTTTTCTGGT 
1450 

AGAAATAAAACCTTGTTTTTCAA 

FIG. 1 CONT'D 
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RESULTS OF PBM SEARCH USING THE wileynbs LIBRARY 

Run=DOM10004 Scan=738 (Sub) 100%=413600 ADC Mass Range=40-456 
23 Sep 97 3:50 Compacted SLRP +EI 1UL C.E O/N INDUCTION + LA 







RpI 


Mol. 




Serial 


^Oll 1 1 ) 


^Oul 1 IC/ 


Wt. 


Formula A Name 


77275 


99 


81 


292 


C19 H32 02 6, 9, 12-Octadecatrienoic acid, methyl ester 


81040 


95 


74 


122 


C9 H1 4 1 , 4-Cyclononadiene 


43157 


95 


74 


292 


C19 H32 02 6, 9, 12-Octadecatrienoic acid, methyl ester 


77274 


93 


68 


292 


C19 H32 02 6, 9, 12-Octadecatrienoic acid, methyl ester 


6892 


60 


35 


136 


C10H16 .BETA.-FENCHENE 


4278 


59 


33 


122 


C9H14 3-Nonen-1-yne, (z)- 


25116 


55 


29 


206 


C15 H26 5-Pentadecen-7-yne, (z)- 


17742 


55 


29 


178 


C13 H22 3-Tridecen-l-yne, (z)- 


13423 


55 


29 


162 


C12 H18 1,4, 8-Dodecatriene, (E, E. E)- 


10169 


34 


11 


150 


C11 H18 Cyclopropane, 1-etheny-2-hexenyl-, 1. alpha., 2. beta. (E) 


2366 


33 


10 


108 


C8 H12 1 , 4-Cyclooctadiene, (z, z)- 


2372 


32 


9 


108 


C8H12 Bicyclo 5.1.0. oct-3-ene 


6909 


24 


6 


136 


C 1 0 H 1 6 Cyclooctene , 3-etheny I- 


10171 


23 


6 


150 


C11 H18 Cyclohexene, 3-(3-methyl-1-butenyl)-, (E)- 


29046 


23 


6 


222 


C15 H26 0 5, 10-Pentadecadienal, (z, z)- 


17743 


21 


5 


178 


C13H22 3-Tridecen-1-yne, (E)- 


10173 


20 


4 


150 


C11 H18 Spiro 5.5.undec-1-ene 


4281 


20 


4 


122 


C9 H14 Bicyclo 5.1 .0. octane, 8-methylene- 


10192 


18 


3 


150 


C11H18 (-)-2-METHYL~2-BORNENE 


4291 


18 


3 


122 


C9 H14 1 , 2-CYCLONONADIENE 


2597 


18 


3 


110 


C8 H14 Cydopentane, (1-methylethylidene)- 


7335 


12 


2 


138 


C10H18 4-Decyne 


67891 


11 


2 


82 


C6 H10 Cyclopropane, 1 , 2-dimethyl-3-methylene- f cis- 


618 


11 


2 


82 


C6 H10 Cyclopropane, 1, 2-dimethyl-3-methylene-, trans- 


71289 


4 


1 


136 


C10H16 2-.BETA.-PINENE 
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250 
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wileynbs: 77275 Rel(sim):99 Rel(same):81 
6.9. 12-Octadecatrienoic acid, methyl ester 
C19 H32 02 MW=292.240230 CAS=2676417 
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1 PRODUCTION OF GAMMA LINOLENIC ACID BY A A6-DESATORASE 

Linoleic acid (18:2) (LA) is transformed 
into gamma linolenic acid (18:3) (GLA) by the enzyme 
5 A6-desaturase . When this enzyme, or the nucleic acid 
encoding it, is transferred into LA-producing cells, 
GLA is produced. The present invention provides 
nucleic acids comprising the A6-desaturase gene. More 
specifically, the nucleic acids comprise the 
10 promoters, coding regions and termination regions of 
the A6-desaturase genes. The present invention is 
further directed to recombinant constructions 
comprising a A6-desaturase coding region in functional 
combination with heterologous regulatory sequences. 
15 The nucleic acids and recombinant constructions of the 
instant invention are useful in the production of GLA 
in transgenic organisms. 

Unsaturated fatty acids such as linoleic 
(C lB &* 12 ) and a-linolenic (C 19 A 9 12 1S ) acids are essential 
20 dietary constituents that cannot be synthesized by 
vertebrates since vertebrate cells can introduce 
double bonds at the A 9 position of fatty acids but 
cannot introduce additional double bonds between the 
A 9 double bond and the methyl -terminus of the fatty 
25 acid chain. Because they are precursors of other 

products, linoleic and o?-linolenic acids are essential 
fatty acids, and are usually obtained from plant 
sources. Linoleic acid can be converted by mammals 
into 7-linolenic acid (GLA, C :? A € - 912 ) which can in turn 
30 be converted to arachidonic acid (20:4), a critically 
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important fatty acid since it is an essential 
precursor of most prostaglandins. 

The dietary provision of linoleic acid, by 
virtue of its resulting conversion to GLA and 
arachidonic acid, satisfies the dietary need for GLA 
and arachidonic acid. However, a relationship has 
been demonstrated between consumption of saturated 
fats and health risks such as hypercholesterolemia, 
atherosclerosis and other clinical disorders which 
correlate with susceptibility to coronary disease, 
while the consumption of unsaturated fats has been 
associated with decreased blood cholesterol 
concentration and reduced risk of atherosclerosis. 
The therapeutic benefits of dietary GLA may result 
15 from GLA being a precursor to arachidonic acid and 
thus subsequently contributing to prostaglandin 
synthesis. Accordingly, consumption of the more 
unsaturated GLA, rather than linoleic acid, has 
potential health benefits. However, GLA is not 
20 present in virtually any commercially grown crop 
plant . 

Linoleic acid is converted into GLA by the 
enzyme A6 -desaturase . A6 -desaturase, an enzyme of 
more than 350 amino acids, has a membrane -bound domain 
25 and an active site for desaturation of fatty acids. 
When this enzyme is transferred into cells which 
endogenously produce linoleic acid but not GLA, GLA is 
produced. The present invention, by providing the 
gene encoding -desaturase , allows the production of 
transgenic organisms which contain functional A6- 
desaturase and which produce GLA. In addition to 
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1 allowing production of large amounts of GLA, the 

present invention provides new dietary sources of GLA. 

The present invention is directed to 
isolated A6-desaturase genes. Specifically, the 
5 isolated genes comprises the A6-desaturase promoters, 
coding regions, and termination regions. 

The present invention is further directed to 
expression vectors comprising the ^6-desaturase 
promoter, coding region and termination region. 
10 Yet another aspect of this invention is 

directed to expression vectors comprising a *6- 
desaturase coding region in functional combination 
with heterologous regulatory regions, i.e. elements 
not derived from the *6-desaturase gene. 

Cells and organisms comprising the vectors 
of the present invention, and progeny of such 
organisms, are also provided by the present invention. 

A further aspect of the present invention 
provides isolated bacterial ^6-desaturase . An 
isolated plant A6-desaturase is also provided. 

Yet another aspect of this invention 
provides a method for producing plants with increased 
gamma linolenic acid content. 

A method for producing chilling tolerant 
plants is also provided by the present invention. 

Fig. 1 depicts the hydropathy profiles of 
the deduced amino acid sequences of S vnechoc vs t i s 
desaturase (Panel A) and Al2-desaturase (Panel B) . 
Putative membrane spanning regions are indicated by 
solid bars. Hydrophobic index was calculated for a 
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1 window size of 19 amino acid residues [Kyte, et al . 
(1982) J. Molec. Biol. 1.57] . 

Fig. 2 provides gas liquid chromatography 
profiles of wild type (Panel A) and transgenic (Panel 

5 B) Anabaena ♦ 

Fig. 3 is a diagram of maps of cosmid cSy75, 
cSyl3 and Csy7 with overlapping regions and subclones. 
The origins of subclones of Csy75, Csy75-3.5 and Csy7 
are indicated by the dashed diagonal lines. 
10 Restriction sites that have been inactivated are in 

parentheses . 

Fig. 4 provides gas liquid chromatography 
profiles of wild type (Panel A) and transgenic (Panel 

B) tobacco. 

Fig. 5A depicts the DNA sequence of a A- 6 
desaturase cDNA isolated from borage. 

Fig. 5B depicts the protein sequence of the 
open reading frame in the isolated borage A- 6 
desaturase cDNA. Three amino acid motifs 
characteristic of desaturases are indicated and are, 
in order, lipid box, metal box 1, and metal box 2. 

Fig. 6 is a dendrogram showing similarity of 
the borage A6 -desaturase to other membrane - bound 
desaturases. The amino acid sequence of the borage 
A6- desaturase was compared to other known desaturases 
using Gene Works (IntelliGenetics) . Numerical values 
correlate to relative phylogenetic distances between 

subgroups compared. 

Fig. 7 is a restriction map of 22l.A6.NOS 
30 and 121.A6.NOS. In 221.A6.NOS, the remaining portion 
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1 of the plasmid is pBI221 and in 121.A6.NOS, the 
remaining portion of the plasmid is pBI121. 

Fig. 8 provides gas liquid chromatography 
profiles of mock transfected (Panel A) and 221.A6.NOS 
5 transfected (Panel B) carrot cells. The positions of 
18:2, 18:3 or, and 18:3 y(GLA) are indicated. 

Fig. 9 provides gas liquid chromatography 
profiles of an untransf ormed tobacco leaf (Panel A) 
and a tobacco leaf transformed with 121.A6.NOS. The 
10 positions of 18:2, 18:3 a, 18:3y(GLA), and 18:4 are 
indicated . 

Fig. 10 provides gas liquid chromotography 
profiles for untransf ormed tobacco seeds (Panel A) and 
seeds of tobacco transformed with 121.A6.NOS. The 

15 positions of 18:2, 18:3a and 18:3-y(GLA) are indicated. 

The present invention provides isolated 
nucleic acids encoding A6-desaturase . To identify a 
nucleic acid encoding A6-desaturase, DNA is isolated 
from an organism which produces GLA. Said organism 

20 can be, for example, an animal cell, certain fungi 
(e.g. Mortierella ) , certain bacteria (e.g. 
Svnechocvstis ) or certain plants (borage, Oenothera, 
currants) . The isolation of genomic DNA can be 
accomplished by a variety of methods well-known to one 

25 of ordinary skill in the art, as exemplified by 
Sambrook et al. (1989) in Molecular Cloning: A 
Laboratory Manual , Cold Spring Harbor, NY. The 
isolated DNA is fragmented by physical methods or 
enzymatic digestion and cloned into an appropriate 

30 vector, e.g. a bacteriophage or cosmid vector, by any 
of a variety of well-known methods which can be found 
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1 in references such as Sambrook et al. (1989) . 

Expression vectors containing the DNA of the present 
invention are specifically contemplated herein. DNA 
encoding ^6-desaturase can be identified by gain of 
5 function analysis. The vector containing fragmented 
DNA is transferred, for example by infection, 
transconjugation, transf ection, into a host organism 
that produces linoleic acid but not GLA. As used 
herein, "transformation" refers generally to the 

10 incorporation of foreign DNA into a host cell. 

Methods for introducing recombinant DNA into a host 
organism are known to one of ordinary skill in the art 
and can be found, for example, in Sambrook et al^ 
(1989). Production of GLA by these organisms (i.e., 

15 gain of function) is assayed, for example by gas 
chromatography or other methods known to the 
ordinarily skilled artisan. Organisms which are 
induced to produce GLA, i.e. have gained function by 
the introduction of the vector, are identified as 

20 expressing DNA encoding A6-desaturase, and said DNA is 
recovered from the organisms. The recovered DNA can 
again be fragmented, cloned with expression vectors, 
and functionally assessed by the above procedures to 
define with more particularity the DNA encoding a6- 

25 desaturase . 

As an example of the present invention, 
random DNA is isolated from the cyanobacteria 
R ynechocvstis Pasteur Culture Collection (PCC) 6803, 
American Type Culture Collection (ATCC) 27184, cloned 

30 into a cosmid vector, and introduced by 

transconjugation into the GLA-def icient cyanobacterium 
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1 Anabaena strain PCC 7120, ATCC 27893. Production of 
GLA from Anabaena linoleic acid is monitored by gas 
chromatography and the corresponding DNA fragment is 
isolated. 

5 The isolated DNA is sequenced by methods 

well-known to one of ordinary skill in the art as 
found, for example, in Sambrook et al. (1989) . 

In accordance with the present invention, 
DNA molecules comprising A6-desaturase genes have been 
10 isolated. More particularly, a 3.588 kilobase (kb) 

DNA comprising a A6-desaturase gene has been isolated 
from the cyanobacteria Svnechocvst is . The nucleotide 
sequence of the 3.588 kb DNA was determined and is 
shown in SEQ ID NO:l. Open reading frames defining 
15 potential coding regions are present from nucleotide 
317 to 1507 and from nucleotide 2002 to 3081. To 
define the nucleotides responsible for encoding a6- 
desaturase, the 3.588 kb fragment that confers *6- 
desaturase activity is cleaved into two subf ragments, 
20 each of which contains only one open reading frame. 
Fragment 0RF1 contains nucleotides 1 through 1704, 
while fragment 0RF2 contains nucleotides 1705 through 
3588. Each fragment is subcloned in both forward and 
reverse orientations into a conjugal expression vector 
25 (AM542, Wolk et al. [1984] Proc . Natl. Acad. Sci . USA 

81, 1561) that contains a cyanobacterial carboxylase 
promoter. The resulting constructs (i.e. ORFl (F) , 
ORFl(R), ORF2(F) and ORF2 (R) ] are conjugated to wild- 
type Anabaena PCC 7120 by standard methods (see, for 

20 example, Wolk et al . (1984) Proc. Natl. Acad. Sci. USA 

82, 1561) . Conjugated cells of Anabaena are 



35 



SUBSTITUTE SHEET (RULE 26) 



WO 96/21022 




PCT/IB95/01 167 



-8- 



1 identified as Neo* green colonies on a brown 

background of dying non-conjugated cells after two 
weeks of growth on selective media (standard mineral 
media BG11N + containing 3 0jig/ml of neomycin according 
5 to Rippka et al., (1979) J. Gen Microbiol, ill, 1). 
The green colonies are selected and grown in selective 
liquid media (BG11N + with 15Mg/ml neomycin) . Lipids 
are extracted by standard methods (e.g. Dahmer et al., 
(1989) .Tnurnal of American Oil Chemical Society 66, 
10 54 3) from the resulting transconjugants containing the 
forward and reverse oriented ORF1 and ORF2 constructs. 
For comparison, lipids are also extracted from wild- 
type cultures of Anabaena and Synechocvstis . The 
fatty acid methyl esters are analyzed by gas liquid 
15 chromatography (GLC) , for example with a Tracor-560 
gas liquid chromatograph equipped with a hydrogen 
flame ionization detector and a capillary column. The 
results of GLC analysis are shown in Table 1. 
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1 Table 1: Occurrence of C18 fatty acids in wild-type 
311(1 transgenic cyanobacteria 



5 



10 



SOURCE 


18:0 


18:1 


18 : 2 


VIA • 


1 /v~l Q • O 

j uio : j 


1 O . A 1 

18:4 


Anabaena 
(wild type) 


+ 


+ 


+ 




+ 




Anabaena + ORF1 (F) 


+ 


I + 


+ 








Anabaena + ORF1 (R) 


+ 


+ 


+ 




+ 




Anabaena + 0RF2(F) 


+ 


+ 


+ 


+ 


+ 




Anabaena + 0RF2(R) 


+ 


+ 


+ 




+ 




Synechocystis 
j (wild type) 


+ 


+ 


+ 


+ 







As assessed by GLC analysis, GLA deficient 
15 Anabaena gain the function of GLA production when the 
construct containing ORF2 in forward orientation is 
introduced by transconjugation . Transconjugants 
containing constructs with ORF2 in reverse orientatior 
to the carboxylase promoter, or ORFi in either 
20 orientation < show no GLA production. This analysis 

demonstrates that the single open reading frame (0RF2) 
within the 1884 bp fragment encodes 46-desaturase. 
The 1884 bp fragment is shown as SEQ ID NO: 3. This is 
substantiated by the overall similarity of the 
25 hydropathy profiles between A6-desaturase and a12- 

desaturase fWada et al. (1990) Nature 347] as shown in 
Fig. 1 as <A) and (B) , respectively. 

Also in accordance with the present 
invention, a cDNA comprising a A6-desaturase gene from 
30 bora 9 e (Boraqo officinalis) has been isolated. The 
nucleotide sequence of the 1.685 kilobase (kb) cDNA 
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10 



was determined and is shown in Fig. 5A (SEQ ID NO: 4) . 
The ATG start codon and stop codon are underlined. 
The amino acid sequence corresponding to the open 
reading frame in the borage delta 6-desaturase is 
shown in Fig. 5B (SEQ ID NO: 5) . 

Isolated nucleic acids encoding a6- 
desaturase can be identified from other GLA-producing 
organisms by the gain of function analysis described 
above, or by nucleic acid hybridization techniques 
using the isolated nucleic acid which encodes 
Synechocvstis or borage c.6 -desaturase as a 
hybridization probe. Both genomic and cDNA cloning 
methods are known to the skilled artisan and are 
contemplated by the present invention. The 
15 hybridization probe can comprise the entire DNA 

sequence disclosed as SEQ. ID NO:l or SEQ. ID NO: 4, or 
a restriction fragment or other DNA fragment thereof, 
including an oligonucleotide probe. Methods for 
cloning homologous genes by cross-hybridization are 
known to the ordinarily skilled artisan and can be 
found, for example, in Sambrook (1989) and Beltz et 
al . (1983) Methods in Enzvmolocrv 100 , 266. 

In another method of identifying a delta 6- 
desaturase gene from an organism producing GLA, a cDNA 
25 library is made from poly-A* RNA isolated from 

polysomal RNA. In order to eliminate hyper -abundant 
expressed genes from the cDNA population, cDNAs or 
fragments thereof corresponding to hyper -abundant 
cDNAs genes are used as hybridization probes to the 
cDNA library. Non hybridizing plaques are excised and 
the resulting bacterial colonies are used to inoculate 
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1 liquid cultures and sequenced. For example, as a 

means of eliminating other seed storage protein cDNAs 
from a cDNA library made from borage polysomal RNA, 
cDNAs corresponding to abundantly expressed seed 
5 storage proteins are first hybridized to the cDNA 

library. The "subtracted" DNA library is then used to 
generate expressed sequence tags (ETSs) and such tags 
are used to scan a data base such as GenBank to 
identify potential desaturates. 
10 Transgenic organisms which gain the function 

of GLA production by introduction of DNA encoding 
desaturase also gain the function of 
octadecatetraeonic acid (18 :4* 6 9 ' 1215 ) production. 
Octadecatetraeonic acid is present normally in fish 
15 oils and in some plant species of the Boraainaceae 
family (Craig et al. [1964] J. Amer. Oil Chem. Soc. 
41, 209-211; Gross et al. [1976] Can. J. Plant Sci . 
56, 659-664) . In the transgenic organisms of the 
present invention, octadecatetraenoic acid results 
20 from further desaturation of a-linolenic acid by a6- 
desaturase or desaturation of GLA by ^15-desaturase . 

The 359 amino acids encoded by ORF2, i.e. 
the open reading frame encoding S vnechocvs t i s a6- 
desaturase, are shown as SEQ. ID NO: 2. The open 
25 reading frame encoding the borage A6-desaturase is 

shown in SEQ ID NO: 5. The present invention further 
contemplates other nucleotide sequences which encode 
the amino acids of SEQ ID NO: 2 and SEQ ID NO: 5. It 
is within the ken of the ordinarily skilled artisan to 
20 identify such sequences which result, for example, 

from the degeneracy of the genetic code. Furthermore, 
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one of ordinary skill in the art can determine, by the 
gain of function analysis described hereinabove, 
smaller subfragments of the fragments containing the 
open reading frames which encode ^6-desaturases . 

The present invention contemplates any such 
polypeptide fragment of A6-desaturase and the nucleic 
acids therefor which retain activity for converting LA 
to GLA. 

In another aspect of the present invention, 
a vector containing a nucleic acid of the present 
invention or a smaller fragment containing the 
promoter, coding sequence and termination region of a 
A6-desaturase gene is transferred into an organism, 
for example, cyanobacteria , in which the A6 -desaturase 
!5 promoter and termination regions are functional . 
Accordingly, organisms producing recombinant A6- 
desaturase are provided by this invention. Yet 
another aspect of this invention provides isolated A6- 
desaturase, which can be purified from the recombinant 
organisms by standard methods of protein purification. 
(For example, see Ausubel et al. [1987] Current 
Protocols in Molecular Biology , Green Publishing 
Associates, New York) . 

Vectors containing DNA encoding a6- 
desaturase are also provided by the present invention. 
It will be apparent to one of ordinary skill in the 
art that~appropriate vectors can be constructed to 
direct the expression of the A6 -desaturase coding 
sequence in a variety of organisms. Replicable 
expression vectors are particularly preferred. 
Replicable expression vectors as described herein are 



20 



25 



30 



35 



SUBSTITUTE SHEET (RULE 26) 



WO 96/21022 




PCT/IB95/01167 



1 DNA or RNA molecules engineered for controlled 

expression of a desired gene, i.e. the *6-desaturase 
gene. Preferably the vectors are plasmids, 
bacteriophages, cosmids or viruses. Shuttle vectors, 
5 e.g. as described by Wolk et al . (1984) Proc. Natl. 
Acad. Sci. USA , 1561-1565 and Bustos et al. (1991) J. 
Bacteriol . 174 , 7525-7533, are also contemplated in 
accordance with the present invention. Sambrook et 
al. (1989), Goeddel, ed. (1990) Methods in Enzvmolocrv 

10 185 Academic Press, and Perbal (1988) A Practical 

Guide to Molecular Cloning , John Wiley and Sons, Inc., 
provide detailed reviews of vectors into which a 
nucleic acid encoding the present ^6-desaturase can be 
inserted and expressed. Such vectors also contain 

15 nucleic acid sequences which can effect expression of 
nucleic acids encoding ^6-desaturase . Sequence 
elements capable of effecting expression of a gene 
product include promoters, enhancer elements, upstream 
activating sequences, transcription termination 

20 signals and polyadenylation sites. Both constitutive 
and tissue specific promoters are contemplated. For 
transformation of plant cells, the cauliflower mosaic 
virus (CaMV) 35S promoter and promoters which are 
regulated during plant seed maturation are of 

25 particular interest. All such promoter and 

transcriptional regulatory elements, singly or in 
combination, are contemplated for use in the present 
replicable expression vectors and are known to one of 
ordinary skill in the art. The CaMV 355 promoter is 

^ n described, for example, by Restrepo et al. (1990) 
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1 Plant Cell 2, 987. Genetically engineered and mutated 
regulatory sequences are also contemplated. 

The ordinarily skilled artisan can determine 
vectors and regulatory elements suitable for 
5 expression in a particular host cell. For example, a 
vector comprising the promoter from the gene encoding 
the carboxylase of Anabaena operably linked to the 
coding region of A6 -desaturase and further operably 
linked to a termination signal from Svnechocvs t i s is 

10 appropriate for expression of A6 -desaturase in 

cyanobacteria . "Operably linked" in this context 
means that the promoter and terminator sequences 
effectively function to regulate transcription. As a 
further example, a vector appropriate for expression 

15 of A6 -desaturase in transgenic plants can comprise a 
seed- specif ic promoter sequence derived from 
helianthinin, napin, or glycinin operably linked to 
the A6 -desaturase coding region and further operably 
linked to a seed termination signal or the nopaline 

20 synthase termination signal. As a still further 
example, a vector for use in expression of A 6- 
desaturase in plants can comprise a constitutive 
promoter or a tissue specific promoter operably linked 
to the A 6- desaturase coding region and further 

25 operably linked to a constitutive or tissue specific 
terminator or the nopaline synthase termination 
signal . 

In particular, the helianthinin regulatory 
elements disclosed in applicant's copending U.S. 
20 Application Serial No. 682,354, filed April 8, 1991 

and incorporated herein by reference, are contemplated 
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1 as promoter elements to direct the expression of the 
A6-desaturase of the present invention. 

Modifications of the nucleotide sequences or 
regulatory elements disclosed herein which maintain 
5 the functions contemplated herein are within the scope 
of this invention. Such modifications include 
insertions, substitutions and deletions, and 
specifically substitutions which reflect the 
degeneracy of the genetic code. 

10 Standard techniques for the construction of 

such hybrid vectors are well-known to those of 
ordinary skill in the art and can be found in 
references such as Sambrook et al. (1989) , or any of 
the myriad of laboratory manuals on recombinant DNA 

15 technology that are widely available. A variety of 
strategies are available for ligating fragments of 
DNA, the choice of which depends on the nature of the 
termini of the DNA fragments. It is further 
contemplated in accordance with the present invention 

20 to include in the hybrid vectors other nucleotide 

sequence elements which facilitate cloning, expression 
or processing, for example sequences encoding signal 
peptides, a sequence encoding KDEL, which is required 
for retention of proteins in the endoplasmic reticulum 

25 or sequences encoding transit peptides which direct 
^6-desaturase to the chloroplast. Such sequences are 
known to one of ordinary skill in the art. An 
optimized transit peptide is described, for example, 
by Van den Broeck et al. (1985) Nature 313 , 358. 

~ n Prokaryotic and eukaryotic signal sequences are 
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1 disclosed, for example, by Michaelis et al. (1982) 
Ann, Rev. Microbiol. 26, 425. 

A further aspect of the instant invention 
provides organisms other than cyanobacteria or plants 
5 which contain the DNA encoding the a 6 -desaturase of 
the present invention. The transgenic organisms 
contemplated in accordance with the present invention 
include bacteria, cyanobacteria, fungi, and plants and 
animals. The isolated DNA of the present invention 

20 can be introduced into the host by methods known in 
the art, for example infection, transf ection, 
transformation or transcon jugation . Techniques for 
transferring the DNA of the present invention into 
such organisms are widely known and provided in 
references such as Sambrook et al. (1989) . 

A variety of plant transformation methods 
are known. The a6 -desaturase gene can be introduced 
into plants by a leaf disk transf ormat ion- regeneration 
procedure as described by Horsch et al. (1985) Science 

20 227 , 1229. Other methods of transformation, such as 
protoplast culture (Horsch et al. (1984) Science 223 , 
496; DeBlock et al. (1984) EMBO J. 2, 2143; Barton et 
al. (1983) Cell 32, 1033) can also be used and are 
within the scope of this invention. In a preferred 

2^ embodiment plants are transformed with Agrobacterium - 
derived vectors. However, other methods are available 
to insert the ^6-desaturase genes of the present 
invention into plant cells. Such alternative methods 
include biolistic approaches (Klein et al. (1987) 
Nature 327 , 70) , electroporation, chemically- induced 
DNA uptake, and use of viruses or pollen as vectors. 
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1 When necessary for the transformation 

method, the *6-desaturase genes of the present 
invention can be inserted into a plant transformation 
vector, e.g. the binary vector described by Bevan 
5 (1984) Nucleic Acids Res. 12 . 8111. Plant 

transformation vectors can be derived by modifying the 
natural gene transfer system of Aarobacterium 
tumef aciens . The natural system comprises large Ti 
(tumor -inducing) -plasmids containing a large segment, 
10 known as T-DNA, which is transferred to transformed 
plants. Another segment of the Ti plasmid, the vir 
region, is responsible for T-DNA transfer. The T-DNA 
region is bordered by terminal repeats. In the 
modified binary vectors the tumor- inducing genes have 
15 been deleted and the functions of the vir region are 
utilized to transfer foreign DNA bordered by the T-DNA 
border sequences. The T-region also contains a 
selectable marker for antibiotic resistance, and a 
multiple cloning site for inserting sequences for 
20 transfer. Such engineered strains are known as 
"disarmed" A. tumef aciens strains, and allow the 
efficient transformation of sequences bordered by the 
T-region into the nuclear genomes of plants. 

Surface-sterilized leaf disks are inoculated 
25 with the "disarmed" foreign DNA-containing A. 
tumef aciens , cultured for two days, and then 
transferred to antibiotic-containing medium. 
Transformed shoots are selected after rooting in 
medium containing the appropriate antibiotic, 
20 transferred to soil and regenerated. 
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1 Another aspect of the present invention 

provides transgenic plants or progeny of these plants 
containing the isolated DNA of the invention. Both 
monocotyledenous and dicotyledenous plants are 
5 contemplated. Plant cells are transformed with the 
isolated DNA encoding a6 -desaturase by any of the 
plant transformation methods described above. The 
transformed plant cell, usually in a callus culture or 
leaf disk, is regenerated into a complete transgenic 

10 plant by methods well-known to one of ordinary skill 
in the art (e.g. Horsch et al. (1985) Science 227, 
112 9) . In a preferred embodiment, the transgenic 
plant is sunflower, oil seed rape, maize, tobacco, 
peanut or soybean. Since progeny of transformed 

15 plants inherit the DNA encoding a6 -desaturase , seeds 
or cuttings from transformed plants are used to 
maintain the transgenic plant line. 

The present invention further provides a 
method for providing transgenic plants with an 

20 increased content of GLA. This method includes 

introducing DNA encoding A6 -desaturase into plant 
cells which lack or have low levels of GLA but contain 
LA, and regenerating plants with increased GLA content 
from the transgenic cells. In particular, 

25 commercially grown crop plants are contemplated as the 
transgenic organism, including, but not limited to, 
sunflower, soybean, oil seed rape, maize, peanut and 
tobacco. 

The present invention further provides a 
method for providing transgenic organisms which 
contain GLA. This method comprises introducing DNA 
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1 encoding *6 -desaturase into an organism which lacks or 
has low levels of GLA, but contains LA. In another 
embodiment, the method comprises introducing one or 
more expression vectors which comprise DNA encoding 
5 ^12 -desaturase and *6 -desaturase into organisms which 
are deficient in both GLA and LA. Accordingly, 
organisms deficient in both LA and GLA are induced to 
produce LA by the expression of a12 -desaturase, and 
GLA is then generated due to the expression of *6- 

10 desaturase. Expression vectors comprising DNA 

encoding £l2-desaturase, or a12 -desaturase and a6- 
desaturase, can be constructed by methods of 
recombinant technology known to one of ordinary skill 
in the art (Sambrook et al . , 1989) and the published 

15 sequence of al2-desaturase (Wada et al [1990] Nature 
(London) 347 , 200-203. In addition, it has been 
discovered in accordance with the present invention 
that nucleotides 2002-3081 of SEQ. ID NO:l encode 
cyanobacterial ^12-desaturase . Accordingly, this 

20 sequence can be used to construct the subject 

expression vectors. In particular, commercially grown 
crop plants are contemplated as the transgenic 
organism, including, but not limited to, sunflower, 
soybean, oil seed rape, maize, peanut and tobacco. 

25 The present invention is further directed to 

a method of inducing chilling tolerance in plants. 
Chilling sensitivity may be due to phase transition of 
lipids in cell membranes. Phase transition 
temperature depends upon the degree of unsaturation of 

20 fatty acids in membrane lipids, and thus increasing 

the degree of unsaturation, for example by introducing 
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1 ^6-desaturase to convert LA to GLA, can induce or 

improve chilling resistance. Accordingly, the present 
method comprises introducing DNA encoding a6- 
desaturase into a plant cell, and regenerating a plant 

5 with improved chilling resistance from said 

transformed plant cell. In a preferred embodiment, 
the plant is a sunflower, soybean, oil seed rape, 
maize, peanut or tobacco plant. 

The following examples further illustrate 
10 the present invention. 
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3_ EXAMPLE 1 

Strains and Culture Conditions 

Svnechocvstis (PCC 6803, ATCC 27184), 
5 Anabaena (PCC 7120, ATCC 27893) and Synechococcus (PCC 
7942, ATCC 33912) were grown photoautotrophically at 
30°C in BG11N+ medium (Rippka et al. [1979] J . Gen . 
Microbiol . Ill . 1-61) under illumination of 
incandescent lamps 

(60^E .m' 2 . S' 1 ) . Cosmids and plasmids were selected and 
propagated in Escherichia coli strain DH5o on LB 
medium supplemented with antibiotics at standard 
concentrations as described by Maniatis et al. (1982) 
Molecular Cloning: A Laboratory Manual . Cold Spring 
25 Harbor Laboratory, Cold Spring, New York. 
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1 EXAMPLE 2 

Construction of Synechocvstis Cosmic! Genomic Library 

Total genomic DNA from Synechocvstis (PCC 
5 6803) was partially digested with Sau3A and 

fractionated on a sucrose gradient (Ausubel et al . 
[1987] Current Protocols in Molecular Biology , Greene 
Publishing Associates and Wiley Interscience, New 
York) . Fractions containing 30 to 40 kb DNA fragments 

10 were selected and ligated into the dephosphorylated 
Bam HI site of the cosmid vector, pDUCA7 (Buikema et 
al . [1991] J. Bacterid . 173 , 1879-1885) . The ligated 
DNA was packaged in vitro as described by Ausubel et 
al. (1987) , and packaged phage were propagated in 

15 coli DH5cv containing the Ava l and Eco4711 methylase 
helper plasmid, pRL528 as described by Buikema et al. 
(1991) . A total of 1152 colonies were isolated 
randomly and maintained individually in twelve 96 -well 
microtiter plates. 
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1 EXAMPLE 3 

Gain- of -Function Expression of GLA in Anabaena 

Anabaena (PCC 7120) , a filamentous 
5 cyanobacterium, is deficient in GLA but contains 
significant amounts of linoleic acid, the precursor 
for GLA (Figure 2; Table 2) . The Svnechocvstis cosmid 
library described in Example 2 was conjugated into 
Anabaena (PCC 7120) to identify transconjugants that 

10 produce GLA. Anabaena cells were grown to mid- log 
phase in BG11N+ liquid medium and resuspended in the 
same medium to a final concentration of approximately 
2x10- cells per ml. A mid-log phase culture of E . 
coli RP4 (Burkardt et al. [1979] J. Gen. Microbiol. 

15 114 , 341-348) grown in LB containing ampicillin was 
washed and resuspended in fresh LB medium. Anabaena 
and RP4 were then mixed and spread evenly on BG11N+ 
plates containing 5% LB. The cosmid genomic library 
was replica plated onto LB plates containing 50 /ig/ml 

20 kanamycin and 17.5 ng/ml chloramphenicol and was 
subsequently patched onto BG11N+ plates containing 
Anabaena and RP4 . After 24 hours of incubation at 
3 0°C, 3 0 pg/ml of neomycin was underlaid; and 
incubation at 30 °C was continued until transconjugants 

25 appeared . 

Individual transconjugants were isolated 
after conjugation and grown in 2 ml BG11N+ liquid 
medium with 15 /xg/ml neomycin. Fatty acid methyl 
esters were prepared from wild type cultures and 
cultures containing pools of ten transconjugants as 
follows. Wild type and transgenic cyanobacterial 
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1 cultures were harvested by centrif ugation and washed 
twice with distilled water. Fatty acid methyl esters 
were extracted from these cultures as described by 
Dahmer et al. (1989) J. Atner. Oil, Chem. Soc. 66, 543- 

5 54 8 and were analyzed by Gas Liquid Chromatography 
(GLC) using a Tracor-560 equipped with a hydrogen 
flame ionization detector and capillary column (30 m x 
0.2 5 mm bonded FSOT Superox II, All tech Associates 
Inc., ID. Retention times and co- chromatography of 
10 standards (obtained from Sigma Chemical Co.) were used 
for identification of fatty acids. The average fatty 
acid composition was determined as the ratio of peak 
area of each C18 fatty acid normalized to an internal 
standard . 

•j^ Representative GLC profiles are shown in 

Fig. 2. C18 fatty acid methyl esters are shown. 
Peaks were identified by comparing the elution times 
with known standards of fatty acid methyl esters and 
were confirmed by gas chromatography-mass 
20 spectrometry. Panel A depicts GLC analysis of fatty 
acids of wild type Anabaena . The arrow indicates the 
migration time of GLA. Panel B is a GLC profile of 
fatty acids of transconjugants of Anabaena with 
pAM54 2+1.8F. Two GLA producing pools (of 25 pools 
25 representing 250 transconjugants) were identified that 
produced GLA. Individual transconjugants of each GLA 
positive pool were analyzed for GLA production; two 
independent transconjugants, AS13 and AS75, one from 
each pool, were identified which expressed significant 
levels of GLA and which contained cosmids, cSyl3 and 
cSy75, respectively (Figure 3). The cosmids overlap 
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1 in a region approximately 7.5 kb in length. A 3.5 kb 
Nhel fragment of cSy75 was recloned in the vector 
pDUCA7 and transferred to Anabaena resulting in gain- 
of- function expression of GLA (Table 2) . 
5 Two Nhe l / Hin d III subfragments (1.8 and 1.7 

kb) of the 3.5 Jcb Nhe I fragment of cSy75-3.5 were 
subcloned into "pBLUESCRIPT" (Stratagene) (Figure 3) 
for sequencing. Standard molecular biology techniques 
were performed as described by Maniatis et al. (1982) 
10 and Ausubel et al. (1987) . Dideoxy sequencing (Sanger 
et al . [1977] Proc. Natl. Acad. Sci . USA 74, 5463- 
5467) of pBSl.8 was performed with "SEQUENASE" (United 
States Biochemical) on both strands by using specific 
oligonucleotide primers synthesized by the Advanced 
15 DNA Technologies Laboratory (Biology Department, Texas 
A & M University) . DNA sequence analysis was done 
with the GCG (Madison, WI) software as described by 
Devereux et al. (1984) Nucleic Acids Res. 12 , 387-395. 
Both Nhe l/ Hin dlll subfragments were 
20 transferred into a conjugal expression vector, AM542, 
in both forward and reverse orientations with respect 
to a cyanobacterial carboxylase promoter and were 
introduced into Anabaena by conjugation. 
Transconjugants containing the 1.8 kb fragment in the 
25 forward orientation (AM542-1.8F) produced significant 
quantities of GLA and octadecatetraenoic acid (Figure 
2; Table 2). Transconjugants containing other 
constructs, either reverse oriented 1.8 kb fragment or 
forward and reverse oriented 1.7 kb fragment, did not 
30 produce detectable levels of GLA (Table 2) . 
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1 Figure 2 compares the C18 fatty acid profile 

of an extract from wild type Anabaena (Figure 2A) with 
that of transgenic Anabaena containing the 1.8 kb 
fragment of cSy75-3.5 in the forward orientation 

5 (Figure 2B) . GLC analysis of fatty acid methyl esters 
from AM542-1.8F revealed a peak with a retention time 
identical to that of authentic GLA standard. Analysis 
of this peak by gas chromatography-mass spectrometry 
(GC-MS) confirmed that it had the same mass 
10 fragmentation pattern as a GLA reference sample. 
Transgenic Anabaena with altered levels of 
polyunsaturated fatty acids were similar to wild type 
in growth rate and morphology. 
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1 Table 2 Composition of C18 Fatty Acids in Wild Type 
and Transgenic Cyanobacteria 



Strain 



Fatty Acid (%) 



18:0 18:1 18:2 18.3(a) 18.3( T ) 18.4 



10 



Wild Type 
Synechocya tis 
(sp.PCC6B03) 



13-6 4.5 54.5 



27.3 



Anabaena 
(sp.PCC7120) 



2.9 24.8 37.1 35.2 



15 



Syn echococcu s 
(sp.PCC7942) 

Anabaena Transcon j ugants 



20.6 79.4 



20 



cSy75 


3.8 


24.4 


22.3 


9.1 


27.9 


12.5 


cSy75-3.5 


4.3 


27.6 


18.1 


3.2 


40.4 


6.4 


pAM542 - 1.8P 


4.2 


13.9 


12.1 


19.1 


25.4 


25.4 


pAM542 - 1.8R 


7.7 


23.1 


38.4 


30.8 






DAM542 - 1.7P 


2.8 


27.8 


36.1 


33.3 






pAM542 - 1.7R 


2.8 


25.4 


42.3 


29.6 







25 



Synechococcus Trans formants 



pAM854 

PAM854 -A" 

pAM854 -A* 

pAM854 -A'&A 12 



27.8 
4.0 
18.2 
42.7 



72.2 

43.2 46.0 
81.8 

25.3 19.5 



16.5 



18:0, stearic acid; 18:1, oleic acid; 18:2, linoleic acid; 
3^ 18:3(or), linolenic acid; 18:3(7), 7-linolenic acid; 18:4, 
octadecatetraenoic acid 



35 



SUBSTITUTE SHEET (RULE 26) 



WO 96/21022 




PCT/IB95/01167 



-28- 



10 



15 



EXAMPLE 4 

Transformation of Synechococcus 
with a6 and a12 Desaturase Genes 

A third cosmid, cSy7, which contains a a12- 
desaturase gene, was isolated by screening the 
p ynpchocvstis genomic library with a oligonucleotide 
synthesized from the published Synechocystis a12- 
desaturase gene sequence (Wada et al. [1990] Nature 
(London) 347 , 200-203) . A 1.7 kb Aval fragment from 
this cosmid containing the a12 -desaturase gene was 
identified and used as a probe to demonstrate that 
cSyl3 not only contains a a6 -desaturase gene but also 
a ^12-desaturase gene (Figure 3) . Genomic Southern 
blot analysis further showed that both the ^6-and a12- 
desaturase genes are unique in the S vne choc vs t i s 
genome so that both functional genes involved in CI 8 
fatty acid desaturation are linked closely in the 
S vnechocvs t i s genome. 

The unicellular cyanobacterium Synechococcus 
(PCC 7942) is deficient in both linoleic acid and 
GLiA ( 3 ) . The a12 and a6- desaturase genes were cloned 
individually and together into pAM854 (Bustos et al . 
[1991] .t. Bacterid. 174 , 7525-7533), a shuttle vector 
that contains sequences necessary for the integration 
2 ^ of foreign DNA into the genome of Synechococcus 

(Golden et al. [1987] Methods in Enzvmol . 153, 215- 
23i) . Synechococcus was transformed with these gene 
constructs and colonies were selected. Fatty acid 
methyl esters were extracted from transgenic 
Synechococcus and analyzed by GLC. 
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1 Table 2 shows that the principal fatty acids 

of wild type Svnechococcus are stearic acid (18:0) and 
oleic acid (18:1). Svnechococcus transformed with 
pAM854-*12 expressed linoleic acid (18:2) in addition 
5 to the principal fatty acids. Trans formants with 
pAM854-*6 and a12 produced both linoleate and GLA 
(Table 1) . These results indicated that Svnechococcus 
containing both a12- and a6-desaturase genes has 
gained the capability of introducing a second double 

10 bond at the *12 position and a third double bond at 
the ^6 position of C18 fatty acids. However, no 
changes in fatty acid composition was observed in the 
transformant containing pAM854-A6, indicating that in 
the absence of substrate synthesized by the a12 

15 desaturase, the A6-desaturase is inactive. This 
experiment further confirms that the 1.8 kb 
Nhel/Hindlll fragment (Figure 3) contains both coding 
and promoter regions of the Svnechocvstis a6- 
desaturase gene. Transgenic Svnechococcus with 

20 altered levels of polyunsaturated fatty acids were 
similar to wild type in growth rate and morphology. 
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1 EXAMPLE 5 

Nucleotide Sequence of i6-Desaturase 

The nucleotide sequence of the 1.8 kb 
5 fragment of cSy75-3.5 including the functional a6- 

desaturase gene was determined. An open reading frame 
encoding a polypeptide of 359 amino acids was 
identified (Figure 4). A Kyte-Doolittle hydropathy 
analysis (Kyte et al. [1982] J. Mol . Biol. 157, 105- 

10 132) identified two regions of hydrophobic amino acids 
that could represent transmembrane domains (Figure 
1A) ; furthermore, the hydropathic profile of the lS- 
desaturase is similar to that of the *i2-desaturase 
gene (Figure IB; Wada et al. ) and *9-desaturases 

15 (Thiede et al. [1986] J Biol. Chem. 261, 13230- 

13235) . However, the sequence similarity between the 
g ynprhocvstis a6 - and *12 -desaturases is less than 40% 
at the nucleotide level and approximately 18% at the 
amino acid level . 

20 
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1 



EXAMPLE 6 



Transfer of Cyanobacterial A 6 -Desaturase into Tobacco 



5 mobilized into a plant expression vector and 

transferred to tobacco using Agrobacterium mediated 
gene transfer techniques. To ensure that the 
transferred desaturase is appropriately expressed in 
leaves and developing seeds and that the desaturase 

10 gene product is targeted to the endoplasmic reticulum 
or the chloroplast, various expression cassettes with 
Svnechocystis ^-desaturase open reading frame (ORF) 
were constructed. Components of these cassettes 
include: (i) a 3 5S promoter or seed specific promoter 

15 derived from the sunflower helianthinin gene to drive 
A 6 -desaturase gene expression in all plant tissues or 
only in developing seeds respectively, (ii) a putative 
signal peptide either from carrot extensin gene or 
sunflower helianthinin gene to target newly 

20 synthesized A 6 -desaturase into the ER, (iii) an ER 
lumen retention signal sequence (KDEL) at the COOH- 
terminal of the a 6 -desaturase ORF, and (iv) an 
optimized transit peptide to target desaturase into 
the chloroplast. The 35S promoter is a derivative of 

25 pRTL2 described by Restrepo et al. (19 90) . The 

optimized transit peptide sequence is described by Van 
de Broeck et al . (1985) . The carrot extensin signal 
peptide is described by Chen et al (1985) EMBO J. 9, 
2145. 

20 Transgenic tobacco plants were produced 

containing a chimeric cyanobacterial desaturase gene, 



The cyanobacterial a 6 -desaturase gene was 
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1 comprised of the Synechocvstis a 6 desaturase gene fused 
to an endoplasmic reticulum retention sequence (KDEL) 
and extensin signal peptide driven by the CaMV 35S 
promoter. PCR amplifications of transgenic tobacco 
5 genomic DNA indicate that the a 6 desaturase gene was 
incorporated into the tobacco genome. Fatty acid 
methyl esters of leaves of these transgenic tobacco 
plants were extracted and analyzed by Gas Liquid 
Chromatography (GLC) . These transgenic tobacco 

10 accumulated significant amounts of GLA (Figure 4) . 

Figure 4 shows fatty acid methyl esters as determined 
by GLC. Peaks were identified by comparing the 
elution times with known standards of fatty acid 
methyl ester. Accordingly, cyanobacterial genes 

15 involved in fatty acid metabolism can be used to 
generate transgenic plants with altered fatty acid 
compositions . 



20 



25 



30 



35 



SUBSmUtE SHEET «U£ 



WO 96/21022 




PCMB95/01167 



1 EXAMPLE 7 

Construction of Borage cDNA library 

Membrane bound polysomes were isolated from 
5 borage seeds 12 days post pollination (12 DPP) using 
the protocol established for peas by Larkins and 
Davies (1975 Plant Phvs . 55:749-756). RNA was 
extracted from the polysomes as described by Mechler 
(1987 Methods in Enzymology 152 : 241-248 , Academic 
10 Press) . 

Poly-A+ RNA was isolated from the membrane 
bound polysomal RNA by use of Oligotex-dT beads 
(Qiagen) . Corresponding cDNA was made using 
Stratagene's ZAP cDNA synthesis kit. The cDNA library 

15 was constructed in the lambda ZAP II vector 

(Stratagene) using the lambda ZAP II vector kit. The 
primary library was packaged in Gigapack II Gold 
packaging extract (Stratagene) . The library was used 
to generate expressed sequence tags (ESTs) , and 

20 sequences corresponding to the tags were used to scan 
the GenBank database. 
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! EXAMPLE 8 

Hybridization Protocol 

Hybridization probes for screening the 
5 borage cDNA library were generated by using random 
primed DNA synthesis as described by Ausubel et al 
(1994 Current Protocols in Molecular Biology , Wiley 
Interscience, N.Y.) and corresponded to previously 
identified abundantly expressed seed storage protein 

10 cDNAs . Unincorporated nucleotides were removed by use 
of a G-50 spin column (Boehringer Manheim) . Probe was 
denatured for hybridization by boiling in a water bath 
for 5 minutes, then quickly cooled on ice. Filters 
for hybridization were prehybridized at 60°C for 2-4 

15 hours in prehybridization solution { 6XSSC [Maniatis et 
al 1984 Molecular Cloning A Laboratory Manual, Cold 
Spring Harbor Laboratory], IX Denharts Solution, 0.05% 
sodium pyrophosphate, 100 /ig/ml denatured salmon sperm 
DNA) . Denatured probe was added to the hybridization 

20 solution (6X SSC, IX Denharts solution, 0.05% sodium 
pyrophosphate, 100 /ig/ml denatured salmon sperm DNA) 
and incubated at 60°C with agitation overnight. 
Filters were washed in 4x, 2x, and lx SET washes for 
15 minutes each at 60°C. A 20X SET stock solution is 

25 3M NaCl,,0.4 M Tris base, 20 mM Na 2 EDTA-2H 2 0 . The 4X 
SET wash was 4X SET, 12 . 5 mM PO, , pH 6 . 8 and 0.2% SDS . 
The 2X SET wash was 2X SET, 12 . 5 mM P0 4 , pH 6 . 8 and 
0.2% SDS. The IX SET wash was IX SET, 12.5 mM P0 4 , pH 
6.8 and 0.2% SDS. Filters were allowed to air dry and 

20 were then exposed to X-ray film for 24 hours with 
intensifying screens at -80°C. 
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EXAMPLE 9 

Random sequencing of cDNAs from a borage seed 
(12 DPP) membrane -bound polysomal library 

The borage cDNA library was plated at low 
density (500 pfu on 150 mm petri dishes) . Highly 
prevalent seed storage protein cDNAs were "subtracted" 
by screening with the previously identified 
corresponding cDNAs . Non-hybridizing plaques were 
excised using Stratagene's excision protocol and 
reagents. Resulting bacterial colonies were used to 
inoculate liquid cultures and were either sequenced 
manually or by an ABI automated sequencer. Each cDNA 
was sequenced once and a sequence tag generated from 
200-300 base pairs. All sequencing was performed by 
cycle sequencing (Epicentre) . Over 300 ESTs were 
generated. Each sequence tag was compared to GenBank 
database by BLASTX computer program and a number of 
lipid metabolism genes, including the A6-desaturase 
were identified. 

Database searches with a cDNA clone 
designated mbp-65 using BLASTX with the GenBank 
database resulted in a significant match to the 
Svnechocystis A6-desaturase . It was determined 
however, that this clone was not a full length cDNA. 
A full length cDNA was isolated using mbp-65 to screen 
the borage membrane -bound polysomal library. The 
sequence of the isolated cDNA was determined (Fig. 5A, 
SEQ ID NO; 4) and the protein sequence of the open 
reading frame (Fig. 5B, SEQ ID NO: 5) was compared to 
other known desaturases using Geneworks 
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1 (intelligGenetics) protein alignment program (Fig. 2). 
This alignment indicated that the cDNA was the borage 
A6 -desaturase gene. 

Although similar to other known plant 
5 desaturases, the borage delta 6-desaturase is distinct 
as indicated in the dendrogram shown in Fig. 6. 
Furthermore, comparison of the amino acid sequences 
characteristic of desaturases, particularly those 
proposed to be involved in metal binding (metal box 1 

10 ^ nd metal box 2) , illustrates the differences between 
the borage delta 6-desaturase and other plant 
desaturases (Table 3) . 

The borage delta 6-desaturase is 
distinguished from the cyanobacterial form not only in 

15 over all sequence (Fig. 6) but also in the lipid box, 
metal box 1 and metal box 2 amino acid motifs (Table 
3) . As Table 3 indicates, all three motifs are novel 
in sequence. Only the borage delta 6-desaturase metal 
box 2 shown some relationship to the Synechoc vst is 

20 delta-6 desaturase metal box 2. 

In addition, the borage delta 6-desaturase 
is also distinct from another borage desaturase gene, 
the delta-12 desaturase. Pl-81 is a full length cDNA 
that was identified by EST analysis and shows high 

25 similarity to the Arabidopsis delta-12 desaturase (Fad 
2) A comparison of the lipid box, metal box 1 and 
metal box 2 amino acid motifs (Table 3) in borage 
delta 6 and delta-12 desaturases indicates that little 
homology exists in these regions. The placement of 

20 the two sequences in the dendrogram in Fig. 6 

indicates how distantly related these two genes are. 
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EXAMPLE 10 

Construction of 222-lA'NOS for transient 
and expression 

The vector pBI221 (Jefferson et al . 1987 
EMBO J. 6:3901-3907) was prepared for ligation by 
digestion with BamHI and EcoICR I (Promega) which 
excises the GUS coding region leaving the 3 5S promoter 
and NOS terminator intact. The borage A 6-desaturase 
cDNA was excised from the Bluescript plasmid 
(Stratagene) by digestion with BamHI and Xhol . The 
Xhol end was made blunt by use of the Klenow fragment. 
This fragment was then cloned into the BamHI /EcoICR I 
sites of pBI221, yielding 221. ^NOS (Fig. 7). In 
221. A 6 . NOS, the remaining portion (backbone) of the 
restriction map depicted in Fig. 7 is pBI221. 
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j EXAMPLE 11 

Construction of 121. A 6 . NOS for stable transformation 

The vector pBI121 (Jefferson et al . 1987 
5 EMBO J. 6:3901-3907) was prepared for ligation by 
digestion with BamHI and EcoICR I (Promega) which 
excises the GUS coding region leaving the 35S promoter 
and NOS terminator intact. The borage A 6-desaturase 
cDNA was excised from the Bluescript plasmid 

20 (Stratagene) by digestion with BamHI and Xhol . The 

Xhol end was made blunt by use of the Klenow fragment . 
This fragment was then cloned into the BamHI /EcoICR I 
sites of pBI121, yielding 121.1A 6 NOS (Fig. 7). In 
121 . A € .NOS , the remaining portion (backbone) of the 

25 restriction map depicted in Fig. 7 is pBI121. 
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1 EXAMPLE 12 

Transient Expression 

All work involving protoplasts was performed 
5 in a sterile hood. One ml of packed carrot suspension 
cells were digested in 30 mis plasmolyzing solution 
(25 g/1 KC1, 3.5 g/1 CaCl 2 -H 2 0, lOmM MES, pH 5.6 and 
0.2 M mannitol) with 1% cellulase, 0.1% pectolyase, 
and 0.1% dreisalase overnight, in the dark, at room 

10 temperature. Released protoplasts were filtered 

through a 150 /im mesh and pelleted by centrif ugation 
(lOOx g, 5 min.) then washed twice in plasmolyzing 
solution. Protoplasts were counted using a double 
chambered hemocytometer . DNA was transfected into the 

25 protoplasts by PEG treatment as described by Nunberg 
and Thomas (1993 Methods in Plant Molecu lar Biology 
and Biotechnology , B.R. Glick and J.E. Thompson, eds. 
pp. 241-248) using 10* protoplasts and 50-70 ug of 
plasmid DNA (221 . A6 .NOS) . Protoplasts were cultured 

2o in 5 mls of MS media supplemented with 0.2M mannitol 
and 3 /im 2,4-D for 4 8 hours in the dark with shaking. 
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2 EXAMPLE 13 

Stable transformation of tobacco 

121.A 6 .NOS plasmid construction was used to 
5 transform tobacco (Nicotiana tabacum cv. xanthi) via 
Agrobacterium according to standard procedures (Horsh 
et al., 1985 Science 227 : 1229-1231; Bogue et al . , 
1990 Mol. Gen. Genet. 221 :49-57) . except that initial 
transformants were selected on 100 ug/ml kanamycin. 

10 
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^ EXAMPLE 14 

Preparation and analysis of 
fatty acid methyl esters (PAMEs) 

Tissue from transfected protoplasts and 
5 transformed tobacco plants was frozen in liquid 
nitrogen and lyophilized overnight. FAMEs were 
prepared as described by Dahmer et al (1989 J. Aroer. 
Oil Chem. Soc. 66:543-548). In some cases, the 
solvent was evaporated again, and the FAMEs were 
10 resuspended in ethyl acetate and extracted once with 
deionized water to remove any water soluble 
contaminants. The FAMEs were analyzed by gas 
chromatography (GO on a JScW Scientific DB-wax column 
(30 m length, 0.25 mm ID, 0.25 urn film). 
15 An example of a transient assay is shown in 

Fig. 8 which represents three independent 
transfections pooled together. The addition of the 
borage A6 -desaturase cDNA corresponds with the 
appearance of gamma linolenic acid (GLA) which is one 
20 of the possible products of L6 -desaturase . 

Figures 9 and 10 depict GC profiles of the 
FAMES derived from leaf and seed tissue, respectively, 
of control and transformed tobacco plants. Figure 9A 
provides the profile of leaf tissue of wild- type 
25 tobacco (xanthi) ; Figure 9B provides the profile of 

leaf tissue from a tobacco plant transformed with the 
borage A- 6 desaturase under the transcriptional 
control of the 35S CaMV promoter (pBI 121A 6 NOS) . 
Peaks correspond to 18:2, 18:3? (GLA), 18:3a and 18:4 
30 (octadecanonic acid) . Figure 10A shows the GC profile 
of seeds of a wild- type tobacco; Figure 10B shows the 
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2 profile of seed tissue of a tobacco plant transformed 
with pBl 121A s NOS. Peaks correspond to 18:2, 
18:3y(GLA) and 18:3or. 

The relative distribution of the C 18 fatty 

5 acids in control and transgenic tobacco seeds is shown 
in Table 4. 

TABLE 4 



10 



Fatty Acid 


Xanthi 


pBI121A*NOS 


18 : 0 


4.0% 


2 . 5% 


18 : 1 


13% 


13% 


18 :2 


82% 


82% 


18 :3y (GLA) 




2.7% 


18 :3a 


0.82% 


1.4% 



The foregoing results demonstrate that GLA 
is incorporated into the triacylglycerides of 
transgenic tobacco leaves and seeds containing the 
borage A6 -desaturase . 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Rhone- Poulenc Agrochimie 

(ii) TITLE OF INVENTION: PRODUCTION OF GAMMA LINOLENIC ACID BY A 

DELTA 6-DESATURASE 

(iii) NUMBER OF SEQUENCES: 25 
(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Scully, Scott, Murphy & Presser 

(B) STREET: 400 Garden City Plaza 

(C) CITY: Garden City 

(D) STATE: New York 

(E) COUNTRY: United States 

(F) ZIP: 11530 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 
<vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 30-DEC-1994 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Presser, Leopold 

(B) REGISTRATION NUMBER: 19,827 

<C) REFERENCE/DOCKET NUMBER: 83 83ZYXW 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (516) 742-4343 

(B) TELEFAX: (516) 742-4366 

(C) TELEX: 230 901 SANS UR 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3588 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
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(ix) FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 2002.. 3081 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l; 



GCTAGCCACC 


AGTGACGATG 


CCTTGAATTT 


GGCCATTCTG 


ACCCAGGCCC 


GTATTCTGAA 


60 


TCCCCGCATT 


CGCATTGTTA 


ATCGTTTGTT 


CAACCATGCC 


CTGGGTAAAC 


GTTTAGACAC 


120 


CACCTTGCCA 


GACCACGTTA 


GTTTGAGTGT 


TTCCGCCCTG 


GCGGCCCCGA 




180 


TGCGGCTTTG 


GGCAATCAGG 


CGATCGGGCA ATTGCGTTTG 


TTTGACCAGA 


CTTGGCCCAT 


240 


TCAGGAAATT 


GTCATTCACC 


AAGACCATCC 


CTGGCTCAAT 


TTACCCCTGG 


CGGATTTATG 


300 


GGATGATCCG 


AGCCGAATGT 


TGATCTATTA 


CCTACCGGCC 


CACAGTGAAA 


CGGATTTAGT 


360 


AGGCGCAGTG 


GTGAATAATT 


TAACGTTGCA 


ATCTGGGGAC 


CATTTAATAG 


TGGGACAAAA 


420 


ACCCCAACCC 


AAGACCAAAC 


GGCGATCGCC 


TTGGCGCAAA 


TTTTCCAAAC 


TGATTACCAA 


480 


CCTGCGGGAG 


TATCAGCGGT 


ATGTCCAACA 


GGTGATATGG 


GTGGTGTTGT 


TTTTATTGTT 


540 


GATGATTTTT 


CTGGCCACCT 


TCATCTACGT 


TTCCATTGAT 


CAACATATTG 


CCCCAGTGGA 


600 


CGCGTTGTAT 


TTTTCCGTGG 


GCATGATTAC 


CGGGGCCGGT 


GGCAAGGAAG 


AGGTGGCCGA 


660 


AAAGTCCCCC 


GATATCATCA 


AAGTATTCAC 


AGTGGTGATG 


ATGATCGCCG 


GGGCGGGGGT 


720 


GATTGGTATT 


TGTTATGCCC 


TACTGAATGA 


TTTCATCCTT 


GGCAGTCGCT 


TTAGTCAGTT 


780 


TTTGGATGCG 


GCCAAGTTAC 


CCGATCGCCA 


TCACATCATC 


ATTTGTGGGC 


TGGGGGGAGT 


840 


GAGCATGGCC 


ATTATTGAAG 


AGTTAATTCA 


CCAGGGCCAT 


GAAATTGTGG 


TAATCGAAAA 


900 


GGATACAGAT 


AATCGTTTCT 


TGCATACGGC 


CCGCTCCCTG 


GGGGTGCCCG 


TAATTGTGGA 


960 


GGATGCCCGC 


CTAGAAAGAA 


CGTTGGCCTG 


CGCCAATATC 


AACCGAGCCG 


AAGCCATTGT 


1020 


GGTGGCCACC 


AGCGACGACA 


CCGTTAACTT 


GGAAATTGGC 


CTAACTGCCA 


AGGCGATCGC 


1080 


CCCTAGCCTG 


CCAGTGGTGT 


TGCGTTGCCA 


GGATGCCCAG 


TTTAGCCTGT 


CCCTGCAGGA 


1140 


AGTATTTGAA 


TTTGAAACGG 


TGCTTTGTCC 


GGCGGAATTG 


GCCACCTATT 


CCTTTGCGGC 


1200 


GGCGGCCCTG 


GGGGGCAAAA 


TTTTGGGCAA 


CGGCATGACC 


GATGATTTGC 


TGTGGGTAGC 


1260 


CCTAGCCACC 


TTAATCACTC 


CTAACCATCC 


CTTTGCCGAC 


CAATTGGTTA 


AAATTGCAGC 


1320 


CCAAAAGTCT 


GATTTCGTTC 


CCCTCTATCT 


AGAACGGGGT 


GGCAAAACCA 


TCCATAGCTG 


1380 


GGAATTATTG 


GGTACCCATC 


TCGACTCTGG 


AGACGTGTTG 


TATTTAACCA 


TGCCCGCCAC 


1440 


TGCCCTAGAG 


CAACTTTGGC 


GATCGCCCCG 


TGCCACTGCT 


GATCCTCTGG 


actctttttt 


1500 
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GGTTTAGCAT GGGGGGATGG AACTCTTGAC TCGGCCCAAT GGTGATCAAG AAAGAACGCT 

TTGTCTATGT TTAGTATTTT TAAGTTAACC AACAGCAGAG GATAACTTCC AAAAGAAATT 

AAGCTCAAAA AGTAGCAAAA TAAGTTTAAT TCATAACTGA GTTTTACTGC TAAACAGCGG 

TGCAAAAAAG TCAGATAAAA TAAAAGCTTC ACTTCGGTTT TATATTGTGA CCATGGTTCC 

CAGGCATCTG CTCTAGGGAG TTTTTCCGCT GCCTTTAGAG AGTATTTTCT CCAAGTCGGC 

TAACTCCCCC ATTTTTAGGC AAAATCATAT ACAGACTATC CCAATATTGC CAGAGCTTTG 

ATGACTCACT GTAGAAGGCA GACTAAAATT CTAGCAATGG ACTCCCAGTT GGAATAAATT 

TTTAGTCTCC CCCGGCGCTG GAGTTTTTTT GTAGTTAATG GCGGTATAAT GTGAAAGTTT 

TTT AT CT ATT TAAATTTATA A ATG CTA ACA GCG GAA AGA ATT AAA TTT ACC 

Met Leu Thr Ala Glu Arg lie Lys Phe Thr 
15 10 

CAG AAA CGG GGG TTT CGT CGG GTA CTA AAC CAA CGG GTG GAT GCC TAC 
Gin Lys Arg Gly Phe Arg Arg Val Leu Asn Gin Arg Val Asp Ala Tyr 

15 20 25 

TTT GCC GAG CAT GGC CTG ACC CAA AGG GAT AAT CCC TCC ATG TAT CTG 
Phe Ala Glu His Gly Leu Thr Gin Arg Asp Asn Pro Ser Met Tyr Leu 
30 35 40 

AAA ACC CTG ATT ATT GTG CTC TGG TTG TTT TCC GCT TGG GCC TTT GTG 
Lvs Thr Leu He He Val Leu Trp Leu Phe Ser Ala Trp Ala Phe Val 
45 SO 55 

CTT TTT GCT CCA GTT ATT TTT CCG GTG CGC CTA CTG GGT TGT ATG GTT 
Leu Phe Ala Pro Val He Phe Pro Val Arg Leu Leu Gly Cys Met Val 
60 65 70 

TTG GCG ATC GCC TTG GCG GCC TTT TCC TTC AAT GTC GGC CAC GAT GCC " 
Leu Ala He Ala Leu Ala Ala Phe Ser Phe Asn Val Gly His Asp Ala 
75 80 85 90 

AAC CAC AAT GCC TAT TCC TCC AAT CCC CAC ATC AAC CGG GTT CTG GGC 
Asn His Asn Ala Tyr Ser Ser Asn Pro His He Asn Arg Val Leu Gly 
95 100 105 

ATG ACC TAC GAT TTT GTC GGG TTA TCT AGT TTT CTT TGG CGC TAT CGC 
Met Thr Tyr Asp Phe Val Gly Leu Ser Ser Phe Leu Trp Arg Tyr Arg 
110 I 15 120 

CAC AAC TAT TTG CAC CAC ACC TAC ACC AAT ATT CTT GGC CAT GAC GTG 
His Asn Tyr Leu His His Thr Tyr Thr Asn He Leu Gly His Asp Val 
125 130 135 
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GTT GOT ATT TAT CGT TTC CAG CAA TTT TAT ATT TGG GGT TTA TAT CTT 2511 
Val Gly lie Tyr Arg Phe Gin Gin Phe Tyr lie Trp Gly Leu Tyr Leu 
155 160 165 170 

TTC ATT CCC TTT TAT TGG TTT CTC TAC GAT GTC TAC CTA GTG CTT AAT 2559 
Phe lie Pro Phe Tyr Trp Phe Leu Tyr Asp Val Tyr Leu Val Leu Asn 
175 180 185 

AAA GGC AAA TAT CAC GAC CAT AAA ATT CCT CCT TTC CAG CCC CTA GAA 2607 
Lys Gly Lys Tyr His Asp His Lys lie Pro Pro Phe Gin Pro Leu Glu 
190 195 200 

TTA GCT AGT TTG CTA GGG ATT AAG CTA TTA TGG CTC GGC TAC GTT TTC 2655 
Leu Ala Ser Leu Leu Gly lie Lys Leu Leu Trp Leu Gly Tyr Val Phe 
205 210 215 

GGC TTA CCT CTG GCT CTG GGC TTT TCC ATT CCT GAA GTA TTA ATT GGT 2703 
Gly Leu Pro Leu Ala Leu Gly Phe Ser lie Pro Glu Val Leu lie Gly 
220 225 230 

GCT TCG GTA ACC TAT ATG ACC TAT GGC ATC GTG GTT TGC ACC ATC TTT 2751 
Ala Ser Val Thr Tyr Met Thr Tyr Gly lie Val Val Cys Thr lie Phe 
235 240 245 250 

ATG CTG GCC CAT GTG TTG GAA TCA ACT GAA TTT CTC ACC CCC GAT GGT 2799 
Met Leu Ala His Val Leu Glu Ser Thr Glu Phe Leu Thr Pro Asp Gly 
255 260 265 

GAA TCC GGT GCC ATT GAT GAC GAG TGG GCT ATT TGC CAA ATT CGT ACC 2847 
Glu Ser Gly Ala lie Asp Asp Glu Trp Ala lie Cys Gin lie Arg Thr 
270 275 280 

ACG GCC AAT TTT GCC ACC AAT AAT CCC TTT TGG AAC TGG TTT TGT GGC 2895 
Thr Ala Asn Phe Ala Thr Asn Asn Pro Phe Trp Asn Trp Phe Cys Gly 
285 290 295 

GGT TTA AAT CAC CAA GTT ACC CAC CAT CTT TTC CCC AAT ATT TGT CAT 2943 
Gly Leu Asn His Gin Val Thr His His Leu Phe Pro Asn lie Cys His 
300 305 310 

ATT CAC TAT CCC CAA TTG GAA AAT ATT ATT AAG GAT GTT TGC CAA GAG 2991 
lie His Tyr Pro Gin Leu Glu Asn lie He Lys Asp Val Cys Gin Glu 
315 320 325 330 

TTT GGT GTG GAA TAT AAA GTT TAT CCC ACC TTC AAA GCG GCG ATC GCC 3039 
Phe Gly Val Glu Tyr Lys Val Tyr Pro Thr Phe Lys Ala Ala He Ala 
335 340 345 

TCT AAC TAT CGC TGG CTA GAG GCC ATG GGC AAA GCA TCG TGACATTGCC 3088 
Ser Asn Tyr Arg Trp Leu Glu Ala Met Gly Lys Ala Ser 

350 355 360 

TTGGGATTGA AGCAAAATGG CAAAATCCCT CGTAAATCTA TGATCGAAGC CTTTCTGTTG 3148 



CCCGCCGACC AAATCCCCGA TGCTGACCAA AGGTTGATGT TGGCATTGCT CCAAACCCAC 3208 
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Leu Trp Leu Phe Ser Ala Trp Ala Phe Val Leu Phe Ala Pro Val He 
50 55 60 

Phe Pro Val Arg Leu Leu Gly Cys Met Val Leu Ala He Ala Leu Ala 
65 7 0 75 80 

Ala Phe Ser Phe Asn Val Gly His Asp Ala Asn His Asn Ala Tyr Ser 
85 90 95 

Ser Asn Pro His He Asn Arg Val Leu Gly Met Thr Tyr Asp Phe Val 
100 105 no 

Gly Leu Ser Ser Phe Leu Trp Arg Tyr Arg His Asn Tyr Leu His His 
115 120 125 

Thr Tyr Thr Asn He Leu Gly His Asp Val Glu He His Gly Asp Glv 
130 135 140 

Ala Val Arg Met Ser Pro Glu Gin Glu His Val Gly He Tyr Arg Phe 
145 150 155 i 6 o 



3588 



TTTGAGGGGG TTCATTGGCC GCAGTTTCAA GCTGACCTAG GAGGCAAAGA TTGGGTGATT 3268 

TTGCTCAAAT CCGCTGGGAT ATTGAAAGGC TTCACCACCT TTGGTTTCTA CCCTGCTCAA 3328 

TGGGAAGGAC AAACCGTCAG AATTGTTTAT TCTGGTGACA CCATCACCGA CCCATCCATG 3388 

TGGTCTAACC CAGCCCTGGC CAAGGCTTGG ACCAAGGCCA TGCAAATTCT CCACGAGGCT 3448 

AGGCCAGAAA AATTATATTG GCTCCTGATT TCTTCCGGCT ATCGCACCTA CCGATTTTTG 3508 

AGCATTTTTG CCAAGGAATT CTATCCCCAC TATCTCCATC CCACTCCCCC GCCTGTACAA 3568 
AATTTTATCC ATCAGCTAGC 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 59 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Leu Thr Ala Glu Arg He Lys Phe Thr Gin Lys Arg Gly Phe Arg 
1 5 io " 15 

Arg Val Leu Asn Gin Arg Val Asp Ala Tyr Phe Ala Glu His Gly Leu 
20 25 30 

Thr Gin Arg Asp Asn Pro Ser Met Tyr Leu Lys Thr Leu He He Val 
35 40 
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Gin Gin Phe Tyr He Trp Gly Leu Tyr Leu Phe He Pro Phe Tyr Trp 
165 170 175 

Phe Leu Tyr Asp Val Tyr Leu Val Leu Asn Lys Gly Lys Tyr His Asp 
180 185 190 

His Lys He Pro Pro Phe Gin Pro Leu Glu Leu Ala Ser Leu Leu Gly 
195 200 205 

He Lys Leu Leu Trp Leu Gly Tyr Val Phe Gly Leu Pro Leu Ala Leu 
210 215 220 

Gly Phe Ser He Pro Glu Val Leu lie Gly Ala Ser Val Thr Tyr Met 
225 230 235 240 

Thr Tyr Gly He Val Val Cys Thr He Phe Met Leu Ala His Val Leu 
245 250 255 

Glu Ser Thr Glu Phe Leu Thr Pro Asp Gly Glu Ser Gly Ala He Asp 
260 265 270 

Asp Glu Trp Ala He Cys Gin He Arg Thr Thr Ala Asn Phe Ala Thr 
275 280 285 

Asn Asn Pro Phe Trp Asn Trp Phe Cys Gly Gly Leu Asn His Gin Val 
290 295 300 

Thr His His Leu Phe Pro Asn He Cys His He His Tyr Pro Gin Leu 
305 310 315 320 

Glu Asn He He Lys Asp Val Cys Gin Glu Phe Gly Val Glu Tyr Lys 
325 330 335 

Val Tyr Pro Thr Phe Lys Ala Ala He Ala Ser Asn Tyr Arg Trp Leu 
340 345 350 

Glu Ala Met Gly Lys Ala Ser 
355 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1884 base pairs 

(B) TYPE : nucleic acid 

(C) STRAND EDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
AGCTTCACTT CGGTTTTATA TTGTGACCAT GGTTCCCAGG CATCTGCTCT AGGGA GTTTT 60 
TCCGCTGCCT TTAGAGAGTA TTTTCTCCAA GTCGGCTAAC TCCCCCATTT TTAGGCAAAA 120 
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TCGTTTCCAG 
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ATATCTT'rX'C 
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GATTAAGCTA 


TTATGGCTCG 


GCTACGTTTT 
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TTTCCCCAAT 


ATTTGTCATA 
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1260 


AATATTATTA 


AGGATUTITG 


CCAAGAGTTT 


GGTGTGGAAT 


ATAAAGTTTA 


TCCCACCTTC 


1320 


AAAGCGGCGA 


TCGCCTCTAA 


CTATCGCTGG 


CTAGAGGCCA 


TGGGCAAAGC 


ATCGTGACAT 


1380 


TGCCTTGGGA 


TTGAAGCAAA 


ATGGCAAAAT 


CCCTCGTAAA 


TCTATGATCG 


AAGCCTTTCT 


1440 


GTTGCCCGCC 


G AC CAAATCC 


CCGATGCTGA 


CCAAAGGTTG 


ATGTTGGCAT 


TGCTCCAAAC 


1500 


CCACTTTGAG 


GGGGTTCATT 


GGCCGCAGTT 


TCAAGCTGAC 


CTAGGAGGCA 


AAGATTGGGT 


1560 


GATTTTGCTC 


AAATCCG ctg 


GGATATTGAA 


AGGCTTCACC 


ACCTTTGGTT 


TCTACCCTGC 


1620 


TCAATGGGAA 


GGACAAACCG 


TCAGAATTGT 


TTATTCTGGT 


GACACCATCA 


CCGACCCATC 


1680 


CATGTGGTCT 


AACCCAGCCC 


TGGCCAAGGC 


TTGGACCAAG 


G CC ATGC AAA 


TTCTCCACGA 


1740 


GGCTAGGCCA 


GAAAAATTAT 


ATTGGCTCCT 


GATTTCTTCC 


GGCTATCGCA 


CCTACCGATT 


1800 
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TTTGAGCATT TTTGCCAAGG AATTCTATCC CCACTATCTC 
ACAAAATTTT ATCCATCAGC TAGC 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1685 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

AATATCTGCC TACCCTCCCA AAGAGAGTAG TCATTTTTCA 

GAAATACATT ACCTCAGATG AACTCAAGAA CCACGATAAA 

GATTCAAGGG AAAGCCTATG ATGTTTCGGA TTGGGTGAAA 

TCCCTTGAAG AGTCTTGCTG GTCAAGAGGT AACTGATGCA 

CTCTACATGG AAGAATCTTG ATAAGTTTTT CACTGGGTAT 

TTCTGAGGTT TCTAAAGATT ATAGGAAGCT TGTGTTTGAG 

TGACAAAAAA GGTCATATTA TGTTTGCAAC TTTGTGCTTT 

GAGTGTTTAT GGGGTTTTGT TTTGTGAGGG TGTTTTGGTA 

GATGGGGTTT CTTTGGATTC AGAGTGGTTG GATTGGACAT 

AGTGTCTGAT TCAAGGCTTA ATAAGTTTAT GGGTATTTTT 

AATAAGTATT GGTTGGTGGA AATGGAACCA TAATGCACAT 

TGAATATGAC CCTGATTTAC AATATATACC ATTCCTTGTT 

TTCACTCACC TCTCATTTCT ATGAGAAAAG GTTGACTTTT 

TGTAAGTTAT CAACATTGGA CATTTTACCC TATTATGTGT 

TGTACAATCT CTCATAATGT TGTTGACCAA GAGAAATGTG 

CTTGGGATGC CTAGTGTTCT CGATTTGGTA CCCGTTGCTT 

GGGTGAAAGA ATTATGTTTG TTATTGCAAG TTTATCAGTG 

GTTCTCCTTG AACCACTTCT CTTCAAGTGT TTATGTTGGA 

GTTTGAGAAA CAAACGGATG GGACACTTGA CATTTCTTGT 

TCATGGTGGA TTGCAATTCC AAATTGAGCA TC ATTTGTTT 



CATCCCACTC CCCCGCCTGT 



1860 
1884 



TCAATGGCTG 
CCCGGAGATC 
GACCATCCAG 
TTTGTTGCAT 
TATCTTAAAG 
TTTTCTAAAA 
ATAGCAATGC 
CATTTGTTTT 
GATGCTGGGC 
GCTGCAAATT 
CACATTGCCT 
GTGTCTTCCA 
GACTCTTTAT 
GCTGCTAGGC 
TCCTATOGAG 
GTTTCTTGTT 
ACTGGAATGC 
AAGCCTAAAG 
CCTCCTTGGA 
CCCAAGATGC 



CTCAAATCAA 
TATGGATCTC 
GTGGCAGCTT 
TCCATCCTGC 
ATTACTCTGT 
TGGGTTTGTA 
TGTTTGCTAT 
CTGGGTGTTT 
ATTATATGGT 
GTCTTTCAGG 
GTAATAGCCT 
AGTTTTTTGG 
CAAG ATTCTT 
TCAATATGTA 
CTCAGGAACT 
TGCCTAATTG 
AACAAGTTCA 
GGAATAATTG 
TGGATTGGTT 
CTAGATGCAA 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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CCTTAGGAAA ATCTCGCCCT ACGTGATCGA GTTATGCAAG AAACATAATT TGCCTTACAA 
TTATGCATCT TTCTCCAAGG CCAATGAAAT GACACTCAGA ACATTGAGGA ACACAGCATT 
GCAGGCTAGG GATATAACCA AGCCGCTCCC GAAGAATTTG GTATGGGAAG CTCTTCACAC 
TCATGGTTAA AATTACCCTT AGTTCATGTA ATAATTTGAG ATTATGTATC TCCTATGTTT 
GTGTCTTGTC TTGGTTCTAC TTGTTGGAGT CATTGCAACT TGTCTTTTAT GGTTTATTAG 
AT GTTTTTTA ATATATTTTA GAG GTTTT GC TTTCATCTCC ATTATTGATG AATAAGGAGT 
TGCATATTGT CAATTGTTGT GCTCAATATC TGATATTTTG GAATGTACTT TGTACCACTG 
TGTTTTCAGT TGAAGCTCAT GTGTACTTCT ATAGACTTTG TTTAAATGGT TATGTCATGT 
TATTT 

<2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 448 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Ala Ala Gin lie Lys Lys Tyr lie Thr Ser Asp Glu Leu Lys Asn 
1 5 10 15 

His Asp Lys Pro Gly Asp Leu Trp He Ser He Gin Gly Lys Ala Tyr 
20 25 30 

Asp Val Ser Asp Trp Val Lys Asp His Pro Gly Gly Ser Phe Pro Leu 
^ 35 ~ 40 45 

Lys Ser Leu Ala Gly Gin Glu Val Thr Asp Ala Phe Val Ala Phe His 
50 55 60 

Pro Ala Ser Thr Trp Lys Asn Leu Asp Lys Phe Phe Thr Gly Tyr Tyr 
65 70 75 80 

Leu Lys Asp Tyr Ser Val Ser Glu Val Ser Lys Asp Tyr Arg Lys Leu 
85 90 95 

Val Phe Glu Phe Ser Lys Met Gly Leu Tyr Asp Lys Lys Gly His He 
100 105 HO 

Met Phe Ala Thr Leu Cys Phe He Ala Met Leu Phe Ala Met Ser Val 
115 120 125 

Tyr Gly Val Leu Phe Cys Glu Gly Val Leu Val His Leu Phe Ser Gly 
130 135 140 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1685 
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Cys Leu Met Gly Phe Leu Trp lie Gin Ser Gly Trp lie Gly His Asp 
145 150 155 160 

Ala Gly His Tyr Met Val Val Ser Asp Ser Arg Leu Asn Lys Phe Met 
165 170 175 

Gly lie Phe Ala Ala Asn Cys Leu Ser Gly lie Ser lie Gly Trp Trp 
180 185 190 

Lys Trp Asn His Asn Ala His His lie Ala Cys Asn Ser Leu Glu Tyr 
195 200 205 

Asp Pro Asp Leu Gin Tyr lie Pro Phe Leu Val Val Ser Ser Lys Phe 
210 215 220 

Phe Gly Ser Leu Thr Ser His Phe Tyr Glu Lys Arg Leu Thr Phe Asp 
225 230 235 240 

Ser Leu Ser Arg Phe Phe Val Ser Tyr Gin His Trp Thr Phe Tyr Pro 
245 250 255 

lie Met Cy6 Ala Ala Arg Leu Asn Met Tyr Val Gin Ser Leu lie Met 
260 265 270 

Leu Leu Thr Lys Arg Asn Val Ser Tyr Arg Ala Gin Glu Leu Leu Gly 
275 280 285 

Cys Leu Val Phe Ser lie Trp Tyr Pro Leu Leu Val Ser Cys Leu Pro 
290 295 300 

Asn Trp Gly Glu Arg lie Met Phe Val lie Ala Ser Leu Ser Val Thr 
305 310 315 320 

Gly Met Gin Gin Val Gin Phe Ser Leu Asn His Phe Ser Ser Ser Val 
325 330 335 

Tyr Val Gly Lys Pro Lys Gly Asn Asn Trp Phe Glu Lys Gin Thr Asp 
34 0 34 5 350 

Gly Thr Leu Asp lie Ser Cys Pro Pro Trp Met Asp Trp Phe His Gly 
355 360 365 

Gly Ser Gin Phe Gin lie Glu His His Leu Phe Pro Lys Met Pro Arg 
370 375 380 

Cys Asn Leu Arg Lys lie Ser Pro Tyr Val lie Glu Leu Cys Lys Lys 
385 390 395 400 

His Asn Leu Pro Tyr Asn Tyr Ala Ser Phe Ser Lys Ala Asn Glu Met 
405 410 415 

Thr Leu Arg Thr Leu Arg Asn Thr Ala Leu Gin Ala Arg Asp lie Thr 
420 425 430 

Lys Pro Leu Pro Lys Asn Leu Val Trp Glu Ala Leu His Thr His Gly 
435 440 445 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Trp lie Gly His Asp Ala Gly His 
1 5 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Asn Val Gly His Asp Ala Asn His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Val Leu Gly His Asp Cys Gly His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
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Val He Ala His Glu Cys Gly His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Val He Gly His Asp Cys Ala His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Val Val Gly His Asp Cys Gly His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

His Asn Ala His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
Hie Asn Tyr Leu His His 



(2) INFORMATION FOR SEQ ID NO: 14: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

His Arg Thr His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

His Arg Arg His His 
1 5 



1 



5 
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(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

His Asp Arg His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

His Asp Gin His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

His Asp His His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

His Asn His His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Phe Gin He Glu His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

His Gin Val Thr His His 
1 5 

(2)' INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 22 : 

His Val He His His 
1 5 

(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
His Val Ala His His 



(2) INFORMATION FOR SEQ ID NO: 24: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

His He Pro His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

His Val Pro His His 
1 5 



1 
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1 WHAT IS CLAIMED : 

1. An isolated nucleic acid encoding a borage 
A6 -desaturase . 

5 

2. The isolated nucleic acid of Claim 1 
comprising the nucleotide sequence of SEQ ID NO: 4. 

3. An isolated nucleic acid that codes for the 
amino acid sequence of SEQ ID NO : 5. 



10 



15 



4. A vector comprising the nucleic acid of any 
one Claims 1- 3 . 

5 . An expression vector comprising the 
isolated nucleic acid of any one of Claims 1-3 operably 
linked to a promoter and optionally a termination signal 
capable of effecting expression of the gene product of 
said isolated nucleic acid. 



20 



6. The expression vector of Claim 5 wherein 
said promoter is a A-6 desaturase promoter, an Anabaena 
carboxylase promoter, a helianthinin promoter, a glycinin 
promoter, a napin promoter, the 35S promoter from CaMV, or 
oc a helianthinin tissue-specific promoter. 

7 The expression vector of Claim 5 wherein 
said promoter is constitutive or tissue-specific. 

8 The expression vector of Claim 5 wherein 
3 ° said termination signal is a Synechocystis termination 
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1 signal, a nopal ine synthase termination signal, or a seed 
termination signal. 

9. A cell comprising the vector of any one of 
5 Claims 4-8. 

10. The cell of Claim 9 wherein said cell is an 
animal cell, a bacterial cell, a plant cell or a fungal 
cell. 

10 

11. A transgenic organism comprising the 
isolated nucleic acid of any one of Claims 1-3. 

12. A transgenic organism comprising the vector 
of any one of Claims 4-8. 

13 . The transgenic organism of Claim 11 or 12 
wherein said organism is a bacterium, a fungus, a plant or 
an animal . 



20 



25 



30 



14 . A plant or progeny of said plant which has 

been regenerated from the plant cell of Claim 10. 

15. The plant of Claim 14 wherein said plant is 
a sunflower, soybean, maize, tobacco, peanut, carrot or 
oil seed rape plant. 

16. A method of producing a plant with 
increased gamma linolenic acid (GLA) content which 
comprises : 
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^ (a) transforming a plant cell with the isolated 

nucleic acid of any one of Claims 1-3; and 

(b) regenerating a plant with increased GLA 
content from said plant cell. 

5 

17. A method of producing a plant with 
increased gamma linolenic acid (GLA) content which 
comprises : 

(a) transforming a plant cell with the vector of 
20 any one of Claims 4-8; and 

(b) regenerating a plant with increased GLA 
content from said plant cell. 

18. The method of Claim 16 or 17 wherein said 
25 plant is a sunflower, soybean, maize, tobacco, peanut, 

carrot or oil seed rape plant. 

19. A method of inducing production of gamma 
linolenic acid (GLA) in an organism deficient or lacking 

2o in GLA which comprises transforming said organism with the 
isolated nucleic acid of any one of Claims 1-3. 

20. A method of inducing production of gamma 
linolenic acid (GLA) in an organism deficient or lacking 

2^ in GLA which comprises transforming said organism with the 
vector of any one of Claims 4-8. 

21. A method of inducing production of gamma 
linolenic acid (GLA) in an organism deficient or lacking 

^ 0 in GLA and linoleic acid (LA) which comprises transforming 
said organism with an isolated nucleic acid encoding 
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1 borage a6 -desaturase and an isolated nucleic acid encoding 
a12 -desaturase . 

22. The method of Claim 21 wherein said 

5 isolated nucleic acid encoding a6 -desaturase comprises 
nucleotides 44 to 1390 of SEQ. ID NO: 4. 

23. A method of inducing production of 
octadecatetraeonic acid in an organism deficient or 
lacking in gamma linolenic acid which comprises 
transforming said organism with the isolated nucleic acid 
of any one- of Claims 1-3, 

24 . A method of inducing production of 
25 octadecatetraeonic acid in an organism deficient or 

lacking in gamma linolenic acid which comprises 
transforming said organism with the vector of any one of 
Claims 4-8. 



25. The method of Claim 23 or 24 wherein said 
organism is a bacterium, a fungus, a plant or an animal. 

26. A method of producing a plant with improved 
chilling resistance which comprises: 

-(a) transforming a plant cell with the isolated 
nucleic acid of any one of Claims 1-3; and 

(b) regenerating said plant with improved 
chilling resistance from said transformed plant cell. 

27. A method of producing a plant with improved 
chilling resistance which comprises: 
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(a) transforming a plant cell with the vector of 
any one of Claims 4-8; and 

(b) regenerating said plant with improved 
chilling resistance from said transformed plant cell. 

5 

28. The method of Claim 26 or 27 wherein said 
plant is a sunflower, soybean, maize, tobacco, peanut, 
carrot or oil seed rape plant . 
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INTERNATIONALER VORLAUFIGER 

PRUFUNGSBERICHT Internationales Aktenzeichen PCT/E POO/06223 

I. Grundlage des Berichts 

1 . Hinsichtlich der Bestandteile der internationalen Anmeldung (Ersatzblatter, die dem Anmeldeamt auf eine 
Aufforderung nach Artikel 14 hin vorgelegt wurden, gelten im Rahmen dieses Berichts als "ursprunglich 
eingereicht" und sind ihm nicht beigefugt, weil sie keine Anderungen enthalten (Regeln 70 16 und 70 17)Y 
Beschreibung, Seiten: " 

1 -34 ursprungliche Fassung 

Patentanspriiche, Nr.: 

1-12 ursprungliche Fassung 

Sequenzprotokoll in der Beschreibung, Seiten: 

1-6, in der ursprunglich eingereichten Fassung. 



2. Hinsichtlich der Sprache: Alle vorstehend genannten Bestandteile standen der Behorde in der Sprache in der 
die Internationale Anmeldung eingereicht worden ist, zur Verfugung oder wurden in dieser eingereicht sofern 
unter diesem Punkt nichts anderes angegeben ist. 

Die Bestandteile standen der Behorde in der Sprache: zur Verfugung bzw. wurden in dieser Sprache 
eingereicht; dabei handelt es sich um 

□ die Sprache der Ubersetzung, die fur die Zwecke der internationalen Recherche eingereicht worden ist (nach 
Regel 23.1(b)). 

□ die Veroffentlichungssprache der internationalen Anmeldung (nach Regel 48.3(b)). 

□ die Sprache der Ubersetzung, die fur die Zwecke der internationalen vorlaufigen Prufung eingereicht worden 
ist (nach Regel 55.2 und/oder 55.3). 

3. Hinsichtlich der in der internationalen Anmeldung offenbarten Nucleotid- und/oder Aminosauresequenz ist die 
Internationale vorlaufige Prufung auf der Grundlage des Sequenzprotokolls durchgefuhrt worden, das: 

H in der internationalen Anmeldung in schriftlicher Form enthalten ist. 

H zusammen mit der internationalen Anmeldung in computerlesbarer Form eingereicht worden ist. 

□ bei der Behorde nachtraglich in schriftlicher Form eingereicht worden ist. 

□ bei der Behorde nachtraglich in computerlesbarer Form eingereicht worden ist. 

□ Die Erkiarung, daB das nachtraglich eingereichte schriftliche Sequenzprotokoll nicht uber den 
Offenbarungsgehalt der internationalen Anmeldung im Anmeldezeitpunkt hinausgeht, wurde vorgelegt. 

□ Die Erkiarung, daf3 die in computerlesbarer Form erfassten Informationen dem schriftlichen 
Sequenzprotokoll entsprechen, wurde vorgelegt. 

4. Aufgrund der Anderungen sind folgende Unterlagen fortgefallen: 

□ Beschreibung, Seiten: 
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□ Anspruche, Nr.: 

□ Zeichnungen, Blatt: 



5. □ 



Dieser Bericht ist ohne Berucksichtigung (von einigen) der Anderungen erstellt worden, da diese aus den 
angegebenen Grunden nach Auffassung der Behorde uber den Offenbarungsgehalt in der ursprunglich 
eingereichten Fassung hinausgehen (Regel 70.2(c)). 

(Auf Ersatzblatter, die solche Anderungen enthalten, ist unter Punkt 1 hinzuweisen;sie sind diesem Bericht 
beizufugen). 



6. Etwaige zusatzliche Bemerkungen: 



V. Begriindete Feststellung nach Artikel 35(2) hinsichtlich der Neuheit, der erf inderischen Tatigkeit und der 
gewerblichen Anwendbarkeit; Unterlagen und Erklarungen zur Stutzung dieser Feststellung 

1. Feststellung 

Neuheit (N) Ja: Anspruche 5,6 

Nein: Anspruche 1-4,7-12 

Erfinderische Tatigkeit (ET) Ja: Anspruche 

Nein: Anspruche 1-12 

Gewerbliche Anwendbarkeit (GA) Ja: Anspruche 1-12 

Nein: Anspruche 



2. Unterlagen und Erklarungen 
siehe Beiblatt 



VIII. Bestimmte Bemerkungen zur internationalen Anmeldung 

Zur Klarheit der Patentanspruche, der Beschreibung und der Zeichnungen Oder zu der Frage, ob die Anspruche 
in vollem Umfang durch die Beschreibung gestutzt werden, ist folgendes zu bemerken: 
siehe Beiblatt 
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Ad section V.: 

1 . Auf folgende Dokumente wird in diesem Bescheid Bezug genommen: 

D1 The plant Journal 15(1), 39-48, 1998 
D2 WO-A-98/46764 
D3 WO-A-96/21022 

2. Die vorliegende Internationale Anmeldung betrifft ein Verfahren zur Herstellung 
von ungesattigten Fettsauren. Es werden transgene Organismen hergestellt 
(vorzugsweise Pflanzen, Algen Oder Pilze), die aufgrund der Expression einer 
delta-6-Desaturase aus Moos einen erhohten Gehalt an Fettsauren, Olen oder 
Lipiden mit delta-6-Doppelbindungen aufweisen. Desweiteren betrifft die 
Internationale Anmeldung die fur das obige Verfahren hergestellten transgenen 
Organismen, die durch das Verfahren hergestellten Ole, Lipide oder Fettsauren, 
sowie deren Verwendung in Futtermitteln, Nahrungsmitteln, Kosmetika oder 
Pharmazeutika. 

Im Hinblick auf die im Internationalen Recherchenbericht zitierten Dokumente 
konnen nur die Anspruche 5 und 6 der vorliegenden Internationalen Anmeldung 
als neu betrachtet werden (Artikel 33(2) PCT). 

2.1 D1 offenbart die Isolierung und Klonierung einer cDNA sowie der dazu 

korrespondierenden genomischen DNA-Sequenz aus dem Moos Physcomitrella 
patens. Das von dieser DNA kodierte Protein wurde als eine delta-6-Desaturase 
identifiziert. Durch Expression des Proteins in S.cerevisiae sowie durch Analyse 
der aus diesem transgenen Organismus gewonnenen Fettsauren konnte bestatigt 
werden, daG die klonierte DNA fur eine delta-6-Desaturase kodiert. Sowohl die 
Nukleotid-Sequenz als auch die Aminosaure-Sequenz der in D1 isolierten DNA 
bzw. des korrespondierenden Proteins weisen eine 100 %ige Identitat uber die 
gesamte Lange mit der in der vorliegenden Internationalen Anmeldung 
offenbarten Sequenz SEQ ID NO:1 bzw. SEQ ID NO:2 auf (s.D1,S.44-47, 
'Functional expression of PPDES6 in Saccharomyces cerevisiae', 
'Discussion', 'Expression in S.cerevisiae', 'Lipid analysis' and Fig.1). 



Das obige Dokument ist daher neuheitsschadlich fur den Gegenstand der 
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Anspruche 1-4 und 7-11. 

2.2 D2 beschreibt eine Methode zur Herstellung von mehrfach ungesattigten 
langkettigen Fettsauren in Pflanzen. Expressionskonstrukte enthaltend DNA- 
Sequenzen kodierend fur eine delta-6-, delta-5- oder delta- 12-Desatu rase wurden 
zunachst zur Herstellung dieser transgenen Pflanzen verwendet. Es wurde 
gezeigt, daB eine Expression dieser Desaturasen in den Pflanzen die Herstellung 
von groBen Mengen an mehrfach ungesattigten Fettsauren ermoglicht, und auf 
diese Weise zu einer Veranderung des Fettsaure-Profils dieser Pflanzen fuhrt. 
Diese Manipulation des Fettsaure-Profils erlaubt nunmehr die Herstellung von 
kommerziell nutzbaren Mengen an Pflanzenolen sowie deren Verwendung als 
Pharmazeutika, Nahrungsmittel etc. (s.D2, S.5, Zeile 27 - S.6, Zeile 17, S.8, Zeile 
19 - S.36, Zeile 27, Beispiele 6-8,13,14,16). 

In Hinblick auf D2 ist der Gegenstand der Anspruche 1 1 und 12 daher nicht neu. 

2.3 D3 offenbart die Klonierung einer DNA kodierend fur eine delta-6- Desatu rase aus 
dem Cyanobakterium Synechocystis sowie einer cDNA kodierend fur eine delta-6- 
Desaturase aus Borretsch. Diese DNA-Sequenzen wurden in verschiedenen 
Organismen, wie z.B. in Tabakpflanzen, exprimiert, und es wurde gezeigt, daB in 
den transgenen Organismen mittels dieser Sequenzen ungesattigte Fettsauren, 
wie z.B. GLA, hergestellt wurden (s.D3, S.3, Zeilen 3-23, S.5. Zeile 16 - S.19, 
Zeile 24, Beispiele 6,13,14, Anspruche 11-18). 

Das obige Dokument ist daher neuheitsschadlich fur den Gegenstand des 
Anspruches 1 1 . 

3. Zur Beurteilung eines erfinderischen Schrittes der Anspruche 5 und 6 der 

vorliegenden Internationalen Anmeldung wird ebenfalls D1 als der nachstliegende 
Stand der Technik herangezogen. 

Diese Anspruche betreffen die Verwendung einer transgenen Alge oder Pflanze, 
insbesondere einer Olfruchtpflanze im Verfahren zur Herstellung von 
ungesattigten Fettsauren. 

Diese abhangigen Anspruche scheinen keine zusatzlichen Merkmale zu 
enthalten, welche in Kombination mit den Merkmalen der Anspruche auf die sie 
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sich beziehen, einen erfinderischen Schritt beinhalten. Die Verwendung 
transgener Pflanzen, bzw. Olfruchtpflanzen, in einem Verfahren zur Herstellung 
von ungesattigten Fettsauren ist bereits aus D2 oder D3 bekannt. 
Der Gegenstand der Anspruche 5 und 6 beruht daher nicht auf einer nach Artikel 
33(3) PCT erforderlichen erfinderischen Tatigkeit. 



Ad section VIII.: 



1 . Den Anspruchen 1 ,4 und 7-9 mangelt es an Klarheit aufgrund der Ausdrucke 
"Organismus" und "Tiere". Die Beschreibung der vorliegenden internationalen 
Anmeldung nimmt nur Bezug auf tierische Zellen, nicht jedoch auf Tiere als 
solche. Desweiteren ist es absolut notwendig klarzustellen, daf3 der Mensch nicht 
unter die Begriffe "Organismus" und "Tiere" fallt. 



Formblatt PCT/Beiblatt/409 (Blatt 3) (EPA-April 1997) 



jt r (12) NACH DEM VERTRAG UBER DIE INTERN ATI ON ALE ZUSAMMENARBEIT AUF DEM GEBIET DES 
PATENTWESENS (PCT) VEROFFENTLICHTE INTERNATIONALE ANMELDUNG 



(19) Weltorganisation fur geistiges Eigentum 
Internationales.Biiro 

(43) Internationales Veroffentlichungsdatum (10) Internationale Veroffentlichungsnummer 

11. Januar 2001 (11.01.2001) PCT WO 01/02591 Al 




(51) Internationale Patentklassifikation 7 : C12N 15/82, 

9/02, 15/53, C12P 7/64, C11C 3/00, A01H 5/00, 13/00, 
15/00, A23L 1/30, A23K 1/16, A61K 35/78 

(21) Internationales Aktenzeichen: PCT/EP00/06223 

(22) Internationales Anmeldedatum: 

4. Juli 2000 (04.07.2000) 

(25) Einreichungssprache: Deutsch 



(74) Gemeinsamer Vertreler: BASF AKTIENGE- 
SELLSCHAFT; 67056 Ludwigshafen (DE). 

(81) Bestimmungsstaaten (national): AE, AG, AL, AM, AT, 
AU, AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CR, CU, 
CZ, DE, DK, DM, DZ, EE, ES, H, GB, GD, GE, GH, GM, 
HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, LK, 
LR, LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, MX, 
MZ, NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, 
TJ, TM, TR, TT, TZ, UA, UG, US, UZ, VN, YU, ZA, ZW. 



(26) Verofientlichungssprache: 



Deutsch 



(30) Angaben zur Prioritat: 

09/347,531 6. Juli 1999 (06.07.1999) US 

100 30 976.3 30. Juni 2000 (30.06.2000) DE 

(71) A nm elder (fur alle Bestimmungsstaaten mit Ausnahme 
von US): BASF AKTIENGESELLSCHAFT [DE/DE]; 
D-67056 Ludwigshafen (DE). 

(72) Erfmder; und 

(75) Erfinder/Anmelder (nur fur US): HEINZ, Ernst 
[DE/DE]; Puettkampsweg 13, D-22609 Hamburg (DE). 
GIRKE, Thomas [DE/US]; 16945 Vinaniz Place, San 
Diego, CA 92128 (US). SCHEFFLER, Jodi [US/US]; 51 
Country Road 228, Oxford, MS 38655 (US). DA COSTA 
E SILVA, Oswaldo [BR/US]; 203 Littleford Lane, Apex, 
NC 27502 (US). 



(84) Bestimmungsstaaten (regional): ARIPO-Patent (GH, 
GM, KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZW), eura- 
sisches Patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
europaisches Patent (AT, BE, CH, CY, DE, DK, ES, FI, 
FR, GB, GR, IE, IT, LU, MC, NL, PT, SE), OAPI-Patent 
(BF, BJ, CF, CG, CI, CM, GA, GN, GW, ML, MR, NE, 
SN, TD, TG). 

Veroffentlicht: 

— Mit internationalem Recherchenbericht. 

— Vor Ablauf der fur Anderungen der Ansprucne geltenden 
Frist; Verbffentlichung wird mederholt. falls Anderungen 
eintreffen. 

Zur Erkldrung der Zweibuchstaben-Codes, und der anderen 
Abkurzungen wird auf die Erklarungen ("Guidance Notes on 
Codes and Abbreviations*) am Anfangjeder regular en Ausgabe 
der PCT-Gazette verwiesen. 



(54) Title: PLANTS EXPRESSING A 6-DES ATURASE GENES AND OILS FROM THESE PLANTS CONTAINING PUFAS 
AND METHOD FOR PRODUCING UNSATURATED FATTY ACIDS 



(54) Bezeichnung: A 6-DES ATURASEGENE EXPRIMIERENDE PFLANZEN UND PUFAS ENTHALTENDE OLE AUS DIE- 
SEN PFLANZEN UND EIN VERFAHREN ZUR HERSTELLUNG UNGESATTIGTER FETTSAUREN 



^ (57) Abstract: The invention relates to an improved method for producing unsaturated fatty acids and to a method for producing 
^ triglycerides with an increased unsaturated fatty acid content. The invention also relates to the production of a transgenic organism, 
preferably a transgenic plant or a transgenic micro-organism, containing increased quantities of unsaturated fatty acids, oils or lipids 
^ with A 6-double bonds as a result of the expression of a A-6-desaturase, from moss. The invention also relates to transgenic organ- 
©\ isms containing a A6-desaturase gene, and to the use of the unsaturated fatty acids or triglycerides with an increased unsaturated 
id fatty acid content produced in the method. 

^ (57) Zusammenfassung: Die vorliegende Erfindung betriftt ein verbessenes Verfahren zur Herstellung von ungesattigten Fettsau- 
_ ren sowie ein Verfahren zur Herstellung von Triglyceriden mit einem erhohten Gehalt an ungesattigten Fettsauren. Die Erfindung 
betriftt die Herstellung eines rransgenen Organismuses bevorzugt einer transgenen Pflanze oder eines transgenen Mikroorganismus 
Q mit erhohtem Gehalt an Fettsauren, Olen oder Lipiden mit A6-Doppelbindungen aufgrund der Expression einer A-6-Desaxurase aus 
£^ Moos. Ausserdem betrifft die Erfindung transgene Organismen, die ein A 6-Desaturasegen enthalten, sowie die Verwendung der im 
^ Verfahren hergestellten ungesattigten Fettsauren bzw. Triglyceride mit einem erhohten Gehalt an ungesattigten Fettsauren. 
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A6-Desaturasegene exprimierende Pflanzen und PUFAS enthaltende 
Ole aus diesen Pflanzen und ein Verfahren zur Herstellung 
ungesattigter Fettsauren 

5 

Beschreibung 

Die vorliegende Erfindung betrifft ein verbessertes Verfahren 
zur Herstellung von ungesattigten Fettsauren sowie ein Verfahren 

10 zur Herstellung von Triglyceriden mit einem erhohten Gehalt an 
ungesattigten Fettsauren. Die Erfindung betrifft die Herstellung 
eines transgenen Organismus ses bevorzugt einer transgenen Pflanze 
oder eines transgenen Mikroorganismus mit erhohtem Gehalt an 
Fettsauren, Olen oder Lipiden mit A6~Doppelbindungen aufgrund 

15 der Expression einer .A-6-Desa.turase aus Moos. 

Au£erdem betrifft die Erfindung transgene Organismen, die ein 
A6-Desaturasegen enthalten, sowie die Verwendung der im Verfahren 
hergestellten ungesattigten Fettsauren bzw. Triglyceride mit 
20 einem erhohten Gehalt an ungesattigten Fettsauren. 

Fettsauren und Triglyceride haben eine Vielzahl von Anwendungen 
in der Lebensmittelindustrie, der Tierernahrung, der Kosmetik und 
im Pharmabereich. Je nachdem ob es sich um freie gesattigte oder 

25 ungesattigte Fettsauren oder um Triglyceride mit einem erhohten 
Gehalt an gesattigten oder ungesattigten Fettsauren handelt, sind 
sie fur die unterschiedlichsten Anwendungen geeignet, so werden 
beispielsweise mehrfach ungesattigte Fettsauren Babynahrung zur 
Erhohung des Nahrwertes zugesetzt. Hauptsachlich werden die ver- 

30 schiedenen Fettsauren und Triglyceride aus Mikroorganismen wie 
Mortierella oder aus 6l-produzierenden Pflanzen wie Soja, Raps, 
Sonnenblume und weiteren gewonnen, wobei sie in der Regel in Form 
ihrer Triacylglyceride anf alien. Sie konnen ai>er auch aus Tieren 
wie Fischen gewonnen werden. Die freien Fettsauren werden vor- 

35 teilhaft durch Verseifung hergestellt. 

Je nach Anwendungszweck sind Ole mit gesattigten oder ungesattig- 
ten Fettsauren bevorzugt, so sind z.B. in der humanen Ernahrung 
Lipide mit ungesattigten Fettsauren speziell mehrfach ungesattig- 

40 ten Fettsauren bevorzugt, da sie einen positiven Einflufc auf den 
Cholesterinspiegel im Blut und damit auf die Moglichkeit einer 
Herzerkrankung haben. Auch eine positive Wirkung auf die Carcino- 
genese wird den -ungesattigten Fettsauren zugeschrieben. Sie 
sind auEerdem wichtige Ausgangsstof f e fur die Synthese von 

45 Verbindungen, die wichtige biologische Vorgange innerhalb des 
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Organismus steuern. Sie finden deshalb in verschiedenen 
diatischen Lebensmitteln oder Medikamenten Anwendung. 

Aufgrund ihrer positiven Eigenschaf ten hat es in der Vergangen- 
5 heit nicht an Ansatzen gefeh.lt, Gene, die an der Synthese 

von Fettsauren bzw. Triglyceriden beteiligt sind, fur die Her- 
stellung von Olen in verschiedenen Organismen mit geandertem 
Gehalt an ungesattigten Fettsauren verfugbar zu machen. So 
wird in WO 91/13972 und seinem US-Aquivalent eine A9-Desaturase 

10 beschrieben. In WO 93/11245 wird eine A15-Desaturase in 

WO 94/11516 wird eine A12-Desaturase beansprucht. A6-Desaturasen 
werden in Girke et al . (The Plant Journal , 15, 1998: 39-48), 
Napier et al . (Biochem. J . , 330, 1998: 611-614), Murata et al . 
(Biosynthesis of y-linolenic acid in cyanobacterium Spirulina 

15 patensis, pp 22-32, In: y-linolenic acid, metabolism an its roles 
in nutrition and medicine, Huang, Y . and Milles, D.E. [eds.], AOC 
Press, Champaign, Illinois), Sayanova et al . (Proc. Natl. Acad. 
Sci. USA, 94, 1997: 4211-4216), WO 98/46764, Cho et al . (J. Biol. 
Chem. , 274, 1999: 471-477), Aki et al . {Biochem. Biophys. Res. 

20 Commun., 255, 1999: 575-579), und Reddy et al . (Plant Mol . Biol., 
27, 1993: 293-300) beschrieben. Weitere Desaturasen werden 
beispielsweise in EP-A-0 550 162, WO 94/18337, WO 97/30582, 
WO 97/21340, WO 95/18222, EP-A-0 794 250, Stukey et al . , J. Biol. 
Chem., 265, 1990: 20144-20149, Wada et al . , Nature 347, 1990: 

25 200-203 Oder Huang et al . , Lipids 34, 1999: 649-659 beschrieben. 
Weitere A 6 -Desaturasen werden in WO 93/06712, US 5,614,393, 
U£5,614,393, WO 96/21022, WO00/21557 und WO 99/27111 beschrieben. 
Die biochemische Charakterisierung der verschiedenen Desaturasen 
ist jedoch bisher nur unzureichend erfolgt, da die Enzyme als 

30 membrangebundene Proteine nur sehr schwer zu isolieren und 
charakterisieren sind (McKeon et al., Methods in Enzymol . 71, 
1981: 12141-12147, Wang et al., Plant Physiol. Biochem., 26, 
1988: 777-792). In der Regel erfolgt die Charakterisierung 
membrangebundener Desaturasen durch Einbringung in einen 

35 geeigneten Organismus, der anschlie£end auf Enzymaktivitat 
mittels Edukt- und Produkt analyse untersucht wird. Die An- 
wendung zur Produktion in transgenen Organismen beschrieben 
wie in WO 98/46763 W098/46764, W098/46765. Dabei wird auch die 
Expression verschiedener Desaturasen wie in W099/ 64616 oder 

40 W098/46776 und Bildung polyungesattigter Fettsauren beschrieben 
und beansprucht. Beziiglich der Effektivitat der Expression von 
Desaturasen und ihren Einflu£ auf die Bildung polyungesattigter 
Fettsauren ist anzumerken, da£ durch Expression einer einzelnen 
Desaturase wie im vorgenannten Stand der Technik beschrieben 

45 lediglich geringe Gehalte an ungesattigten Fettsauren beispiels- 
weise an A-6 ungesattigten Fettsauren/Lipiden wie z.B. y-Linolen- 
saure erreicht wurden und werden. 
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Nach wie vbr besteht daher ein grofier Bedarf an neuen und besser 
geeigneten Genen, die fur Enzyme codieren, die an der Biosynthese 
ungesattigter Fettsauren beteiligt sind und es ermoglichen, diese 
in einem technischen Mafcstab herzustellen. Weiterhin besteht 
5 nach wie vor ein Bedarf an verbesserten Verfahren zur Gewinnung 
moglichst hoher Gehalte an polyungesattigten Fettsauren. 

Es bestand daher die Aufgabe ein Verfahren zur Herstellung 
von ungesattigten Fettsauren unter Verwendung von Genen, die 

10 beispielsweise fur Desaturase-Enzyme codieren und die an der 
Synthese mehrf ach ungesattigter Fettsauren in den Samen einer 
Olsaat beteiligt sind, bereitzustellen und so den Gehalt poly- 
ungesattigter Fettsauren zu erhohen. Diese Aufgabe wurde durch 
ein Verfahren zur Herstellung von ungesattigten Fettsauren 

15 gelost, dadurch gekennzeichnet , daS mindestens eine isolierte 
Nukleinsauresequenz, die fur ein Polypeptid mit A6-Desaturase- 
aktivitat codiert, ausgewahlt aus der Gruppe: 

a) einer Nukleinsauresequenz mit der in SEQ ID NO: 1 dar- 
20 gestellten Seguenz, 

b) Nukleinsauresequenzen, die sich als Ergebnis des degenerier- 
ten genetischen Codes von der in SEQ ID NO: 1 ableiten 

25 c) Derivate der in SEQ ID NO: 1 dargestellten Nukleinsaure- 
sequenz, die' fur Polypeptide mit der in SEQ ID NO: 2 
dargestellten Aminosauresequenzen codieren und mindestens 
50 % Homologie auf Aminosaureebene aufweisen, ohne da£ die 
enzymatische Wirkung der Polypeptide wesentlich reduziert 

30 ist, 

in einen Organismus eingebracht wird, dieser Organismus angezogen 
wird, wobei der angezogene Organismus mindestens 1 Mol-% unge- 
sattigte Fettsauren bezogen auf den gesamten Fettsauregehalt 
35 im Organismus enthalt. 

Unter Anzucht des Organismus ist die Kultivierung von Pflanzen 
ebenso zu verstehen wie die Anzucht von eukaryontischen oder 
prokaryontischen Mikroorganismen wie Bakterien, Hefen, Pilzen, 
40 Ciliaten, Algen, Cyanobakterien, tierischen oder pflanzlichen 
Zellen oder Zellverbanden oder die Anzucht von Tieren. 

Die in den im erf indungsgemaSen Verfahren gewonnenen Organismen 
enthalten in der Regel ungesattigte Fettsauren in Form von 
45 gebundenen Fettsauren, das heiSt die ungesattigten Fettsauren 
liegen uberwiegend in Form ihrer Mono-, Di- oder Triglyceride, 
Glycolipide, Lipoproteine oder Phospholipide wie Ole oder Lipide 
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oder sonstig als Ester oder Amide gebundenen Fettsauren vor . 
Auch freie Fettsauren sind in den Organismen in Form der freien 
Fettsauren oder in Form ihrer Salze enthalten. Die freien oder 
gebundenen ungesattigten Fettsauren enthalten vorteilhaft gegen- 
5 uber den Ausgangsorganismen einen erhohten Gehalt an Fettsauren 
mit A6-Doppelbindungen wie vorteilhaft y-Linolensaure . Die durch 
Anzucht im erf indungsgema£en Verfahren gewonnenen Organismen 
und die in ihnen enthaltenen ungesattigten Fettsauren konnen 
direkt beispielsweise zur Hersteliung von pharmazeutischen 

10 Zubereitungen, von Agrochemikalien, Futtermitteln oder Lebens- 
mitteln verwendet werden oder aber nach Isolierung aus den 
Organismen. Dabei konnen alle Stufen der Aufreinigung der unge- 
sattigten Fettsauren verwendet werden, das hei&t von Rohextrakten 
der Fettsauren bis zu vollstandig gereinigten Fettsauren sind fur 

15 die Hersteliung der vorgenannten Produkte geeignet . In einer vor- 
teilhaften Ausf iihrungsf orm konnen die gebundenen Fettsauren aus 
beispielsweise den Olen bzw. Lipiden beispielsweise liber eine 
basische Hydrolyse z.B. mit NaOH oder KOH freigesetzt werden. 
Diese freien Fettsauren konnen direkt im erhaltenen Gemisch oder 

20 nach weiterer Aufreinigung zur Hersteliung von pharmazeutischen 
Zubereitungen, von Agrochemikalien , Futtermitteln oder Lebens- 
mitteln verwendet werden. Auch konnen die gebundenen oder freien 
Fettsauren zur Umesterung oder Veresterung beispielsweise mit 
anderen Mono-, Di- oder Triglyceriden oder Glycerin verwendet 

25 werden, um den Anteil an ungesattigten Fettsauren in diesen 
Verbindungen beispielsweise in den Triglyceriden zu erhohen. 

Ein weiterer erf indungsgemaSer Gegenstand ist ein Verfahren zur 
Hersteliung von Triglyceriden mit einem erhohten Gehalt an unge- 

30 sattigten Fettsauren, indem man Triglyceride mit gesattigten oder 
ungesattigten oder gesattigten und ungesattigten Fettsauren mit 
mindestens einem der Protein, das durch die Seguenz SEQ ID NO: 2 
codiert wird, inkubiert. Vorteilhaft wird das Verfahren in Gegen- 
wart von Verbindungen durchgefxihrt , die Reduktionsaquivalente 

35 aufnehmen oder abgeben konnen. AnschlieSend konnen die Fettsauren 
aus den Triglyceriden freigesetzt werden. 

Die oben genannten Verfahren ermoglichen vorteilhaft die Synthese 
von Fettsauren oder gebundenen Fettsauren wie Triglyceriden mit 
40 einem erhohten Gehalt an Fettsauren mit A6-Doppelbindungen. 

Als Organismen fur die genannten Verfahren seien beispielhaf t 
Pflanzen wie Arabidopsis, Gerste, Weizen, Roggen, Hafer, Mais, 
Soja, Reis, Baumwolle, Zuckerriibe, Tee, Karotte, Paprika, Canola, 
45 Sonnenblume, Flachs, Hanf, Kartoffel, Triticale, Tabak, Tomate, 
Raps, Kaffee, Tapioka, Maniok, Pfeilwurz, Tagetes, Alfalfa, 
Erdnufi , Rizinus, Kokosnu£, Olpalme, Farbersaflor (Carthamus 
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tinctorius ) , Salat und den verschiedenen Baum- , Nu£- und Wein- 
spezies, oder Kakaobohne, Mikroorganismen wie Pilze Mortierella, 
Saprole'gnia oder Pythium, Bakterien wie die Gattung Escherichia, 
Cyanobakterien, Algen oder Protozoen wie Dixiof lagellaten wie 
5 Crypthecodinium genannt . Bevorzugt werden Organismen, die 
natiirlicherweise Ole in groSeren Mengen synthetisieren konnen 
wie Mikroorganismen wie Pilze wie Mortierella alpina, Pythium 
insidiosum oder Pflanzen wie Soja, Raps, KokosnuE, Olpalme, 
Canola, Farbersaflor (Carthamus tinctorius), Rizinus, Calendula, 
10 Lein, Borretsch, Erdnu£, Kakaobohne oder Sonnenblume, besonders 
bevorzugt werden Soja, Raps oder Sonnenblume. 

Die in den Verfahren verwendeten Organismen werden je nach 
Wirtsorganismus in dem Fachmann bekannter Weise angezogen bzw. 

15 geziichtet. Mikroorganismen wie Bakterien, Pilze, Ciliaten, 

pflanzliche oder tierische Zellen werden in der Regel in einem 
fliissigen Medium, das eine Kohlenstoff quelle meist in Form von 
Zuckern, eine Sticks toff quelle meist in Form von organischen 
Stickstof fguellen wie Hefeextrakt oder Salzen wie Ammoniumsulf at , 

20 Spurenel entente wie Eisen-, Mangan-, Magnesiumsalze und gegebenen- 
falls Vitamine enthalt, bei Temperaturen zwischen 0°C und 100°C, 
bevorzugt zwischen 10°C bis 60°C unter je nach Organismus Sauer- 
stof fbegasung oder in Abwesenheit von Sauerstoff angezogen. Dabei 
kann der pH der Nahrf lussigkeit auf einen festen Wert gehalten 

25 werden, das hei£t der pH wird wahrend der Anzucht regnliert 
oder der pH wird nicht reguliert und verandert sich wahrend 
der Anzucht. Die Anzucht kann batch weise, semi batch weise 
oder kontinuierlich erfolgen. Nahrstoffe konnen zu beginn der 
Fermentation vorgelegt oder semikontinuierlich oder kontinuier- 

30 lich nach gefuttert werden. Auch eine Anzucht auf festen Medien 
ist moglich. 

Pflanzen werden nach Transformation in der Regel zunachst 
regeneriert und anschlieSend wie liblich angezogen bzw. angebaut. 
35 Dies kann im Gewachshaus oder im Freiland erfolgen. 

Aus den Organismen werden nach Anzucht die Lipide in ublicher- 
weise gewonnen. Hierzu konnen die Organismen nach Ernte zunachst 
auf geschlossen werden oder direkt verwendet werden. Die Lipide 

40 werden vorteilhaft mit geeigneten Losungsmitteln wie apolare 

Losungsmittel wie Hexan oder Ethanol, Isopropanol oder Gemischen 
wie Hexan/ Isopropanol, Phenol /Chloroform/ Isoamylalkohol bei 
Temperaturen zwischen 0°C bis 80°C, bevorzugt zwischen 20°C bis 
50°C extrahiert. Die Biomasse wird in der Regel mit einem Uber- 

45 schuE an Losungsmittel extrahiert beispielsweise einem UberschuS 
von Losungsmittel zu Biomasse von 1:4. Das Losungsmittel wird 
anschliefiend beispielsweise liber eine Destination entfernt. 
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Die Extraktion kann auch mit superkritischem CO2 erfolgen. 
Nach Extraktion kann die restliche Biomasse beispielsweise 
uber Filtration entfernt we r den. 

5 Das so gewonnene Rohol kann anschlie£end weiter aufgereinigt 
werden, beispielsweise in dem Triibungen uber das Versetzen 
mit polaren Losungsmittel wie Aceton oder Chloroform und 
anschlie&ender Filtration oder Zentrif ligation entfernt werden. 
Auch eine weitere Reinigung uber chromatographische Verfahren, 
10 Destination oder Kristallisation ist moglich. 

Zur Gewinnung der freien Fettsauren aus den Triglyceriden werden 
diese in ublicher Weise, wie oben beschrieben, verseift. 

15 Ein weiterer Gegenstand der Erfindung sind ungesattigte Fett- 
sauren sowie Trigylceride mit einem erhohten Gehalt an unge- 
sattigten Fettsauren, die nach den oben genannten Verfahren 
hergestellt wurden, sowie deren Verwendung zur Herstellung von 
Nahrnngsmitteln, Tierfutter, Kosmetika oder Pharmazeutika. Hierzu 

20 werden diese den Nahrungsmitteln, dem Tierfutter, den Kosmetika 
oder Pharmazeutika in tiblichen Mengen zugesetzt. 

Im erf indungsgemaSen Verfahren warden durch Expression einer 
A6-Desaturase aus Moos in Organismen wie Pilze, Bakterien, 

25 Tieren oder Pf lanzen, bevorzugt Pilzen, Bakterien und Pflanzen, 
besonders bevorzugt in Pflanzen, ganz besonders bevorzugt in 
Olfruchtpf lanzen wie Raps, Canola, Lein, Soja, Sonnenblume, 
Borretsch, Rizinus, Olpalme, Farbersaflor (Carthamus tinctorius), 
KokosnuS, ErdnuS oder Kakaobohne hohere Gehalte an ungesattigten 

30 Fettsauren wie y-Linolensaure erhalten. Auch die Expression in 
Feldf riichten, wie Mais, Weizen, Roggen, Hafer, Triticale, Reis, 
Gerste, Alfalfa, oder Buschpf lanzen (Kaffee, Kakao, Tee) ist 
vorteilhaft. Durch die Expression eines Gens, das fur eine 
A-6-Desaturase aus Moos codiert, in den oben genannten Organismen 

35 konnen Gehalte an ungesattigten Fettsauren in den Organismen von 
mindestens 1 Mol-%, bevorzugt mindestens 3 Mol-%, besonders be- 
vorzugt mindestens 4 Mol-%, ganz besonders bevorzugt mindestens 
5 Mol-% erreicht werden. 

40 Unter Derivate(n) sind beispielsweise funktionelle Homologe 

der von SEQ ID NO: 1 codierten Enzyme oder deren enzymatischer 
Aktivitat, das hei£t Enzyme, die dieselben enzymatischen 
Reaktionen wie die von SEQ ID NO: 1 katalysieren, zu verstehen. 
Diese Gene ermoglichen ebenfalls eine vorteilhafte Herstellung 

45 von ungesattigten Fettsauren mit Doppeibindungen in A6-Position. 
Unter ungesattigten Fettsauren sind im folgenden doppelt oder 
mehrfach ungesattigte Fettsauren, die Doppeibindungen aufweisen, 
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zu verstehen. Die Doppelbindungen konnen konjugiert oder nicht 
konjugiert sein. Die in SEQ ID NO: 1 genannte Sequenz codiert 
fur ein Enzym, das eine A6-Desaturase-Aktivitat aufweist. 

5 Das erf indungsgema&e Enzym A6-Desaturase fuhrt vorteilhaft in 
Fettsaurereste von Glycerolipiden eine cis-Doppelbindung in 
Position C 6 -C 7 ein (siehe SEQ ID NO: 1). Das Enzym hat au£erdem 
eine A6-Desaturase-Aktivitat , die vorteilhaft in Fettsaurereste 
von Glycerolipiden ausschlieSlich eine ci s-Doppelbindung in 
10 Position C 6 -C 7 einfuhrt. Diese Aktivitat hat auch das Enzym mit 
der in SEQ ID NO: 1 genannten Seqttenz, bei dem es sich um eine 
monofunktionelle A6-Desaturase handelt. 

Die im erf indungsgema&en Verfahren verwendeten Nukleinsaure- 
15 sequenz(en) (fur die Anmeldung soli der singular den plural 

umfassen und umgekehrt) oder Fragmente davon konnen vorteilhaft 
zur Isoiierung weiterer genomischer Sequenzen xiber Homologies 
screening verwendet werden . 

20 Die genannten Derivate lassen sich beispielsweise aus anderen 
Organismen eukaryontischen Organismen wie Pflanzen wie speziell 
Moosen, Dinof lagellaten oder Pilze isolieren. 

Weiterhin sind unter Derivaten bzw. f unktionellen Derivaten der 

25 in SEQ ID NO: 1 genannten Sequenz beispielsweise Allelvarianten 
zu verstehen, die mindestens 50 % Homologie auf der abgeleiteten 
Aminosaureebene, vorteilhaft mindestens 70 % Homologie, bevorzugt 
mindestens 80 % Homologie, besonders bevorzugt mindestens 85 % 
Homologie, ganz besonders bevorzugt 90 % Homologie aufweisen. 

30 Die Homologie wurde uber den gesamten Aminosaurebereich be- 

rechnet. Es wurde das Programm PileUp, BESTFIT, GAP, TRANSLATE 
bzw. BACKTRANSLATE (= Bestandteil des Programmpaketes UWGCG, 
Wisconsin Package, Version 10.0-UNIX, January 1999, Genetics 
Computer Group, Inc., Deverux et al , Nucleic. Acid Res., 12, 

35 1984: 387-395) verwendet (J. Mol . Evolution., 25, 351-360, 1987, 
Higgins et al . , CABIOS, 5 1989: 151-153). Die von den genannten 
Nukleinsauren abgeleitete Aminos aure sequenz ist Seguenz 
SEQ ID NO: 2 zu entnehmen. Unter Homologie ist Identitat zu 
verstehen, das hei&t die Aminosauresequenzen sind zu mindestens 

40 50 % identisch. Die erf indungsgema&en Sequenzen sind auf Nuklein- 
saureebene mindestens 65 % homolog, bevorzugt mindestens 7 0 %, 
besonders bevorzugt 75 %, ganz besonders bevorzugt mindestens 
80 %. 

45 Allelvarianten umfassen insbesondere funktionelle Varianten, die 
durch Deletion, Insertion oder Slabs titut ion von Nukleotiden aus 
der in SEQ ID NO: 1 dargestellten Sequenz erhaltlich sind, wobei 
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die enzymatische' Aktivitat der abgeleiteten synthetisierten 
Proteine erhalten bleibt. 

Solche DNA-Sequenzen lassen sich ausgehend von der in 
5 SEQ ID NO: 1 beschriebenen DNA-Seqruenz oder Teilen dieser 

Seguenzen, beispielsweise mit ublichen Hybridisierungsverf ahren 
Oder der PCR-Technik aus anderen Eukaryonten wie beispielsweise 
den oben genannt isolieren. Diese DNA-Sequenzen hybridisieren 
unter Standardbedingungen mit den genannten Seguenzen . Zur 

10 Hybridisierung werden vorteilhaft kurze Oligonukleotide bei- 
spielsweise der konservierten Bereiche, die uber Vergleiche mit 
anderen Desaturasegenen in dem Fachmann bekannter Weise ermittelt 
werden konnen, verwendet . Vorteilhaft werden die Histidin-Box- 
Seguenzen verwendet. Es konnen aber auch langere Fragmente der 

15 erfindungsgemafcen Nukleinsauren oder die vollstandigen Seguenzen 
fur die Hybridisierung verwendet werden. Je nach der verwendeten 
Nukleinsaure: Oligonukleotid, langeres Fragment oder vollstandige 
Secjuenz oder je nachdem welche Nukleinsaureart DNA oder RNA fur 
die Hybridisierung verwendet werden, variieren diese Standard- 

20 bedingungen. So liegen beispielsweise die Schmelztemperaturen fur 
DNA : DNA-Hybr ide ca. 10 °C niedriger als die von DNArRNA-Hybriden 
gleicher Lange. 

Unter Standardbedingungen sind beispielsweise je nach Nuklein- 
25 saure Temperaturen zwischen 42 und 58°C in einer wa£rigen Puffer- 
losung mit einer Konzentration zwischen 0,1 bis 5 x SSC (1 X SSC 
= 0,15 M NaCI, 15 niM Natriumcitrat , pH 7,2) oder zusatzlich in 
Gegenwart von 50 % Formamid wie beispielsweise 42°C in 5 x SSC, 
50 % Formamid zu verstehen. Vorteilhaf terweise liegen die 
30 Hybridisierungsbedingungen fur DNA : DNA-Hybride bei 0,1 x SSC und 
Temperaturen zwischen etwa 20°C bis 45°C, bevorzugt zwischen etwa 
30°C bis 45°C." Fur DNA : RNA— Hybride liegen die Hybridisierungs- 
bedingungen vorteilhaft bei 0,1 x SSC und Temperaturen zwischen 
etwa 30°C bis 55°C, bevorzugt zwischen etwa 45°C bis 55°C. Diese 

35 angegebenen Temperaturen fur die Hybridisierung sind beispielhaf t 
kalkulierte Schmelz temper aturwerte fur eine Nukleinsaure mit 
einer Lange von ca. 100 Nukleotiden und einem G + C-Gehalt von 
50 % in Abwesenheit von Formamid. Die experimentellen Bedingungen 
fur die DNA-Hybridisierung sind in einschlagigen Lehrbuchern der 

40 Genetik wie beispielsweise Sambrook et al . , "Molecular Cloning", 
Cold Spring Harbor Laboratory, 1989, beschrieben und lassen sich 
nach dem Fachmann bekannten Formeln beispielsweise abhangig von 
der Lange der Nukleinsauren, der Art der Hybride oder dem G + 
C-Gehalt berechnen. Weitere Inf ormationen zur Hybridisierung kann 

45 der Fachmann folgenden Lehrbuchern entnehmen: Ausubel et al . 

(eds), 1985, Current Protocols in Molecular Biology, John Wiley & 
Sons, New York; Hames and Higgins (eds), 1985, Nucleic Acids 
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Hybridization: A Practical Approach, IRL Press at Oxford 
University Press, Oxford; Brown (ed), 1991, Essential Molecular 
Biology: A Practical Approach, IRL Press at Oxford University 
Press, Oxford. 

5 

Weiterhin sind unter Derivaten Homologe der Sequenz SEQ ID No: 1 
beispielsweise eukaryontische Homologe, verkurzte Sequenzen, Ein- 
zelstrang-DNA der codierenden und nichtcodierenden DNA-Sequenz 
oder RNA der codierenden und nichtcodierenden DNA-Sequenz zu 
10 verstehen. 

AuSerdem sind unter Homologen der Sequenz SEQ ID NO: 1 Derivate 
wie beispielsweise Promotorvarianten zu verstehen. Diese Varian- 
ten konnen durch ein oder mehrere Nukleotidaustausche , durch 
15 Insertion (en) und/ oder Deletion (en) verandert sein, ohne daS aber 
die Funktionalitkt bzw. Wirksamkeit der Promotoren beeintrachtigt 
sind. Des weiteren konnen die Promotoren durch Veranderung ihrer 
Sequenz in ihrer Wirksamkeit erhoht oder komplett durch wirk- 
samere Promotoren auch artfremder Organismen ausgetauscht werden. 

20 

Unter Derivaten sind auch vorteilhaft Varianten zu verstehen, 
deren Nukleotidsequenz im Bereich -1 bis -2000 vor dem Startcodon 
so verandert wurden, da£ die Genexpression und/ oder die Protein- 
expression verandert, bevorzugt erhoht wird. Weiterhin sind unter 
25 Derivaten auch Varianten zu verstehen, die am 3 ' -Ende verandert 
wurden . 

Die Nukleinsauresequenzen, die fur eine A6-Desaturase codiert, 
konnen synthetisch hergestellt oder naturlich gewonnen sein oder 

30 eine Mischung aus synthetischen und natiirlichen DNA-Bestandteilen 
enthalten, sowie aus verschiedenen heterologen A6-Desaturase-Gen- 
abschnitten verschiedener Organismen bestehen. Im allgemeinen 
werden synthetische Nukleotid-Sequenzen mit Codons erzeugt, die 
von den entsprechenden Wirtsorganismen beispielsweise Pflanzen 

35 bevorzugt werden. Dies fuhrt in der Regel zu einer optimalen 
Expression der heterologen Gene. Diese von Pflanzen bevorzugten 
Codons konnen aus Codons mit der hochsten Proteinhauf igkeit 
bestimmt werden, die in den meisten interessanten Pflanzen- 
spezies exprimiert werden. Ein Beispiel fur Corynebacterium 

40 glutamicum ist gegeben in: Wada et al . (1992) Nucleic Acids Res. 
20:2111-2118) . Die Durchfuhrung solcher Experimente sind mit 
Hilfe von Standardmethoden durchfuhrbar und sind dem Fachmann 
auf dem Gebiet bekannt. 

45 Funktionell aquivalente Sequenzen, die fur das A6-Desaturase-Gen 
codieren, sind solche Derivate der erf indungsgema&en Sequenz, 
welche trotz abweichender Nukleotidseguenz noch die gewunschten 
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Funktionen, das heiSt die enzymatische Aktivitat der Proteine 
besitzen. Funktionelle Aquivalente umfassen somit naturlich 
vorkoromende Varianten der hierin beschriebenen Seguenzen sowie 
kiinstliche, z.B. durch chemische Synthese erhaltene, axi den 
5 Codon-Gebrauch einer Pflanze angepa£te, kiinstliche Nukleotid- 
Sequenzen . 

AuSerdem sind artif izielle DNA-Seguenzen geeignet, solange sie, 
wie oben beschrieben, die gewiinschte Eigenschaft beispiels- 

10 weise der Erhohung des Gehaltes von A6-Doppelbindungen in Fett- 
sauren, Olen oder Lipiden in der Pflanze durch Uberexpression 
des A6-Desaturase-Gens in Kulturpf lanzen vermitteln. Solche 
artif iziellen DNA-Sequenzen konnen beispieisweise durch Riick- 
iibersetzung mittels Molecular Modelling konstruierter Proteine, 

15 die A6-Desaturase-Aktivitat aufweisen oder durch in vitro- 
Selektion ermittelt werden. Mogliche Techniken zur in vitro- 
Evolution von DNA zur Veranderung bzw. Verbesserung der DNA— 
Sequenzen sind beschrieben bei Patten, P. A. et al., Current 
Opinion in Biotechnology 8, 724-733 ( 1997) oder bei Moore, J.C. 

20 et al., Journal of Molecular Biology 272, 336-347 (1997). 
Besonders geeignet sind codierende DNA-Sequenzen, die durch 
Ruckubersetzung einer Polypeptidsequenz gemaS der fur die 
Wirtspflanze spezifischen odon-Nutzung erhalten werden. 
Die spezifische Codon-Nutzung kann ein mit pf lanzengenetischen 

25 Methoden vertrauter Fachmann durch Computerauswertungen anderer, 
bekaimter Gene der zu trans formierenden Pflanze leicht ermitteln. 

Als weitere geeignete aquivalente Nukleinsaure-Sequenzen sind 
zu nennen Sequenzen, welche fur Fusionsproteine codieren, wobei 

30 Bestazidteil des Fusionsproteins ein A6-Desaturase-Polypeptid 
oder ein funktionell aquival enter Teil davon ist. Der zweite* 
Teil des Fusionsproteins kann z.B. ein weiteres Polypeptid mit 
enzymatischer Aktivitat sein oder eine antigene Polypeptidsequenz 
mit deren Hilf e ein Nachweis auf A6-Desaturase-Expression mog- 

35 lich ist (z.B. myc-tag oder his-tag) . Bevorzugt handelt es sich 
dabei jedoch um eine regulative Proteins equenz , wie z.B. ein 
Signalsequenz fiir das ER, das das A6-Desaturase-Protein an den 
gewiinschten Wirkort leitet . 

40 Vorteilhaf t konnen die A6-Desaturase-Gene im erf indungsgemaSen 
Verf ahren mit weiteren Genen der Fettsaurebiosynthese kombiniert 
werden. Beispiele fiir derartige Gene sind die Acetyltransf erasen, 
weitere Desaturasen oder Elongasen ungesattigter oder gesattigter 
Fettsauren wie in WO 00/12720 beschrieben. Fiir die in-vivo und 

45 speziell in-vitro Synthese ist die Kombination mit z.B. NADH- 
Cytochrom B5 Reduktasen vorteilhaf t, die Reduktionsaquivalente 
aufnehmen oder abgeben konnen. 
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Unter den im erf indungsgema£en Verfahren verwendeten Proteine 
sind Proteine zu verstehen, die eine in der Seguenz SEQ ID NO: 2 
dargestellte Aminosaureseguenz oder eine daraus durch Substi- 
tution, Inversion, Insertion oder Deletion von einem oder 
5 mehreren Aminosaureresten erhaltliche Seguenz en thai ten, wobei 
die enzymatische Aktivitat des in SEQ ID NO: 2 dargestellten 
Proteins erhalten bleibt bzw. nicht wesentiich reduziert wird. 
Unter nicht wesentiich reduziert sind alle Enzyme zu verstehen, 
die noch mindestens 10 %, bevorzugt 20 %, besonders bevorzugt 

10 30 % der enzymatischen Aktivitat des Ausgangsenzyms aufweisen. 
Dabei konnen beispielsweise bestimmte Aminosauren durch solche 
mit ahnlichen physikochemischen Eigenschaf ten (Raumerf tillung, 
Basizitcit, Hydrophobizitat etc.) ersetzt werden . Beispielsweise 
werden Argininreste gegen Lysinreste, Valinreste gegen Isoleucin- 

15 reste oder Asparaginsaurereste gegen Glutaminsaurereste aus- 
getauscht . Es konnen aber auch ein oder mehrere Aminosauren in 
ihrer Reihenfolge vertauscht, hinzugefiigt oder entfernt werden, 
oder es konnen mehrere dieser Ma£nahirten miteinander kombiniert 
werden . 

20 

Unter Derivaten sind auch funktionelle Aguivalente zu verstehen, 
die insbesondere auch natiirliche oder kunstliche Mutationen einer 
urspriinglich isolierten fur A6-Desaturase codierende Seguenz 
beinhalten, welche weiterhin die gewiinschte Funktion zeigen, das 

25 hei£t das deren enzymatische Aktivitat nicht wesentiich reduziert 
ist. Mutationen umfassen Substitutionen, Additionen, Deletionen, 
Vertauschungen oder Insertionen eines oder mehrerer Nukleotid- 
reste. Somit werden beispielsweise auch solche Nukleotidseguenzen 
durch die vorliegende Erfindung mit umfa£t, welche man durch 

30 Modif ikation der A6-Desaturase Nukleotidseguenz erhalt. Ziel 
einer solchen Modifikation kann z.B. die weitere Eingrenzung 
der darin enthaltenen codierenden Seguenz oder z.B. auch die 
Einfiigung weiterer Restriktionsenzym-Schnittstellen sein. 

35 Funktionelle Aguivalente sind auch solche Variant en, deren 
Funktion, verglichen mit dem Ausgangsgen bzw. Genfragment, 
abgeschwacht (= nicht wesentiich reduziert) oder verstarkt ist 
(= Enzymaktivitat ist starker als die Aktivitat des Ausgangs- 
enzym, das hei£t Aktivitat ist hoher als 100 %, bevorzugt hoher 

40 als 110 %, besonders bevorzugt hoher als 130 %) . 

Die im erf indungsgema&en Verfahren verwendeten oben genannten 
Nukleinsaureseguenzen werden vorteilhaft zum Einbringen in 
einen Wirtsorganismus in eine Expressionskassette inseriert. 
45 Die Nukleinsaureseguenzen konnen jedoch auch direkt in den Wirts- 
organismus eingebracht werden. Die Nukleinsaureseguenz kann dabei 
vorteilhaft beispielsweise eine DNA- oder cDNA-Seguenz sein. 
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Zur Insertion in eine Expressionskassette geeignete codierende 
Sequenzen sind beispielsweise solche, die fur eine A6-Desaturase 
mit deh oben beschxiebenen Sequenzen codieren und die dem Wirt 
die Fahigkeit zur Uberproduktion von Fettsauren, Olen oder 
5 Lipiden mit Doppelbindungen in A6-Position verleihen. Diese 
Sequenzen konnen homologen oder heterologen Ursprungs sein. 

Unter einer Expressionskassette (= Nukleinsaurekonstrukt oder 
-fragment) ist die in SEQ ID NO: 1 genannte Sequenz , die sich 

10 als Ergebnis des genetischen Codes und/ oder deren f unktionellen 
oder nicht f unktionellen Derivate zu verstehen, die mit einem 
oder mehreren Regulationssignalen vorteilhaf terweise zur Erhohung 
der Genexpression funktionell verkniipft wurden und welche die 
Expression der codierenden Sequenz in der Wirtszelle steuern. 

15 Diese regulatori^chen Sequenzen sollen die gezielte Expression 
der Gene und der Proteinexpression ermoglichen. Dies kann bei- 
spielsweise je nach Wirtsorganismus bedeuten, da£ das Gen erst 
nach Induktion exprimiert und/oder iiberexprimiert wird, oder da£ 
es sofort exprimiert und/oder iiberexprimiert wird. Beispielsweise 

20 handelt es sich bei diesen regulatorischen Sequenzen urn Sequenzen 
an die Induktoren oder Repressoren binden und so die Expression 
der Nukleinsaure regulieren. Zusatzlich zu diesen neuen Regu- 
lationssequenzen oder anstelle dieser Sequenzen kann die natiir- 
liche Regulation dieser Sequenzen vor den eigentlichen Struktur- 

25 genen noch vorhanden sein und gegebenenf alls genetisch verandert 
worden sein, so daS die natiirliche Regulation ausgeschaltet und 
die Expression der Gene erhoht wurde. Das Genkonstrukt kann aber 
auch einfacher aufgebaut sein, das hei£t es wurden keine zusatz- 
lichen Regulationssignale vor die Nukleinsauresequenz oder dessen 

30 Derivate inseriert und der naturliche Promotor mit seiner Regu- 
lation wurde nicht entfernt. Stattdessen wurde die naturliche 
Regulations sequenz so mutiert, daS keine Regulation mehr erfolgt 
und/oder die Genexpression gesteigert wird. Diese veranderten 
Promotoren konnen in Form von Teilsequenzen (= Promotor mit 

35 Teilen der erf indungsgemafcen Nukleinsauresequenzen) auch allein 
vor das naturliche Gen zur Steigerung der Aktivitat gebracht 
werden. Das Genkonstrukt kann au£erdem vorteilhaf terweise auch 
eine oder mehrere sogenannte w enhancer Sequenzen" funktionell 
verknupft mit dem Promotor enthalten, die eine erhohte Expression 

40 der Nukleinsauresequenz ermoglichen. Auch am 3 ' -Ende der DNA- 
Sequenzen konnen zusatzliche vorteilhafte Sequenzen inseriert 
werden wie weitere regulatorische Elemente oder Terminatoren. 
Das A6-Desaturase-Gen kann in einer oder mehreren Kopien in 
der Expressionskassette {= Genkonstrukt) enthalten sein. Auch 

45 eventuell mit exprimierte Gene, die vorteilhaf t an der Fettsaure- 
biosynthese beteiligt sind, konnen in einer oder mehreren Kopien 
in der Expressionskassette vorhanden sein. 
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Die regulatorischen Seguenzen bzw. Faktoren konnen dabei wie 
oben beschrieben vorzugsweise die Genexpression der eingefuhrten 
Gene positiv beeinf lussen und dadurch erhohen. So kann eine Ver- 
starkung der regulatorischen Elemente vorteilhaf terweise auf der 
5 Transkriptionsebene erfolgen, indem starke Transkriptionssignale 
wie Promotoren und/ oder " Enhancer" verwendet werden. Daneben ist 
aber auch eine Verstarkung der Translation moglich, indent bei- 
spielsweise die Stabilitat der mRNA verbessert wird. 

10 Als Promotoren in der Expressionskassette sind grundsatzlich 
alle Promotoren geeignet, die die Expression von Fremdgenen in 
Organ i smen vorteilhaf t in Pflanzen oder Pilzen steuern konnen. 
Vorzugsweise verwendet man insbesondere einen pflanzlichen 
Promotor oder Promotoren, die beispielsweise aus einem Pflanzen- 

15 virus entstammen. Vorteilhaf te Regulationsseguenzen fur das 

erf indungsgema&e Verfahren sind beispielsweise in Promotoren wie 
cos-, tac- # trp-, tet-, trp- tet-, Ipp-, lac-, Ipp-lac-, lad*-* 
T7-, T5-, T3-, gal-, trc-, ara-, SP6-, ?L-Pr- oder im ?i-P L -Promotor 
enthalten, die vorteilhaf terweise in gram-negativen Bakterien 

20 Anwendung finden. Weitere vorteilhaf te Reguiationssequenzen sind 
beispielsweise in den gram-positiven Promotoren amy und SP02 , in 
den Hefe- oder Pilzpromotoren ADC1, MFa, AC, P-60, CYC1, GAPDH, 
TEF, rp28, ADH oder in den Pf lanzenpromotoren wie CaMV/35S 
[Franck et al . , Cell 21(1980) 285-294], RUBISCO SSU, OCS, B33, 

25 nos ( = Nopalin Synthase Promotor) oder im Ubiguitin-Promotor 
enthalten. Die Expressionskassette kann auch einen chemisch 
induzierbaren Promotor enthalten, durch den die Expression des 
exogenen A6-Desaturase-Gens in den Organismen vorteilhaf t in den 
Pflanzen zu einem bestimmten Zeitpunkt gesteuert werden kann. 

30 Derartige vorteilhaf te Pf lanzenpromotoren sind beispielsweise 
der PRPl-Promotor [Ward et al . , Plant. Mol. Biol. 22 (1993), 
361-366], ein durch Benzensulf onamid-induzierbarer (EP 388186), 
ein durch Tetrazyklin-induzierbarer (Gatz et al . , (1992) Plant 
J. 2,397-404), ein durch Salizylsaure induzierbarer Promotor 

35 (WO 95/19443), ein durch Abscisinsaure-induzierbarer (EP335528) 
bzw. ein durch Ethanol- oder Cyclohexanon-induzierbarer 
(WO 93/21334) Promotor. Weitere Pf lanzenpromotoren sind bei- 
spielsweise der Promotor der cytosolischen FBPase aus Kartoffel, 
der ST-LSI Promotor aus Kartoffel (Stockhaus et al., EMBO J. 

40 8 (1989) 2445-245), der Promotor der Phosphoribosylpyrophosphat 
Amidotransf erase aus Glycine max (siehe auch Genbank Accession 
Nummer U87999) oder ein Nodien-spezif ischen Promotor wie in 
EP 24967 6 konnen vorteilhaf t verwandt werden. Vorteilhaf t sind 
insbesondere solche pflanzliche Promotoren, die die Expression in 

45 Geweben oder Pf lanzenteilen/-organen sicherstellen, in denen die 
Fettsaurebiosynthese bzw. dessen Vorstufen stattfindet wie bei- 
spielsweise im Endosperm oder im sich entwickelnden Embryo. Ins- 
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besondere zu nennen sind vorteilhafte Promo tor en, die eine samen- 
spezif ische Expression gewahrleisten wie beispieisweise der USP- 
Promotor oder Derivate davon, der LEB4 -Promotor , der Phaseolin- 
Promotor oder der Napin-Promotor . Der erf indungsgema£ aufgeftihrte 
5 und besonders vorteilhafte USP-Promotor oder dessen Derivate ver- 
mitteln in der Samenentwicklung eine sehr friih Genexpression 
(Baeumlein et al., Mol Gen Genet, 1991, 225 (3): 459-67). Weitere 
vorteilhafte samenspezif ische Promotoren, die fur monokotyle und 
dikotyle Pflanzen verwendet werden konnen, sind die fur Dikotyle 

10 geeignete Promotoren wie ebenfalls beispielhaft ausgefiihrte 

Napingen- Promotor aus Raps (US5 , 608 , 152 ) , der Oleosin-Promotor 
aus Arabidopsis (W098/45461 ) , der Phaseolin-Promotor aus 
Phaseolus vulgaris (US5 , 504 , 200) , der Bce4-Promotor aus Brassica 
(W091/ 13980) oder der Leguminosen B4-Promotor (LeB4, Baeumlein 

15 et al., Plant J. , 2, 2, 1992: 233 - 239) oder fiir Monokotyle 

geeignete Promotoren wie die Promotoren die Promotoren des lpt2- 
oder lptl-Gens aus Gerste (W095/15389 und WO95/23230) oder die 
Promotoren des Gersten Hordein-Gens , des Reis Glutei in-Gens , des 
Reis Oryzin-Gens, des Reis Prolamin-Gens , des Weizen Gliadin- 

20 Gens, des Weizen Glutelin-Gens , des Mais Zein-Gens, des Hafer 

Glutelin-Gens , des Sorghum Kasirin-Gens oder des Roggen Secalin- 
Gens, die in WO99/16890 beschrieben werden. 

Weiterhin sind insbesondere solche Promotoren bevorzugt, die 
25 die Expression in Geweben oder Pf lanzenteilen sicherstellen, in 
denen beispieisweise die Biosynthese von Fettsauren, Olen und 
Lipiden bzw. deren Vorstufen stattfindet. Insbesondere zu nennen 
sind Promotoren, die eine samenspezif ische Expression gewahr- 
leisten. Zu nennen sind der Promotor des Napin-Gens aus Raps 
30 (US 5,608,152), des USP-Promotor aus Vicia faba (USP=unbekanntes 
Samenprotein, Baeumlein et al . , Mol Gen Genet, 1991, 225 (3): 
459-67), des Oleosin-Gens aus Arabidopsis (W098/45461 ) , des 
Phaseolin-Promotors (US 5,504,200) oder der Promotor des Legumin 
B4-Gens (LeB4; Baeumlein et al . , 1992, Plant Journal, 2 (2): 
35 233-9) . Weiterhin sind zu nennen Promotoren, wie der des lpt2 
oder lptl-Gens aus Gerste (W095/15389 und WO95/23230) , die in 
monokotylen Pflanzen samenspezif ische Expression vermitt'eln. 

In der Expressionskassette (= Genkonstrukt , Nukleinsaurekon- 
40 strukt) konnen wie oben beschrieben noch weitere Gene, die in 
die Organismen eingebracht werden sollen, enthalten sein. Diese 
Gene konnen unter getrennter Regulation oder unter der gleichen 
Regulationsregion wie das A6-Desaturase-Gen liegen. Bei diesen 
Genen handelt es sich beispieisweise um weitere Biosynthesegene 
45 vorteilhaft der Fettsaurebiosynthese, die eine gesteigerte 

Synthese ermoglichen. Beispieisweise seien die Gene fiir die A15-, 
A12-, A9-, A5-, A4-Desaturase, die verschiedenen Hydroxylasen, 
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die Acyl-ACP-Thioesterasen, f}-Ketoacyl-Synthasen oder j3-Ketoacyl- - 
Reductasen genannt . Vorteilhaft werden die Desaturasegene im 
Nukleihsaurekonstrukt verwendet. 

5 Prinzipiell konnen alle naturlichen Promotoren mit ihren Regu- 
lationssequenzen wie die oben genaimten fur die erf indungsgemaSe 
Expressionskassette und das erf indungsgema£e Verfahren, wie 
unten beschrieben, verwendet werden. Dariiberhinaus konnen auch 
synthetische Promotoren vorteilhaft verwendet werden. 

10 

Es konnen verschiedene DNA-Fragmente manipuliert werden, um 
eine Nukleotid-Sequenz zu erhalten, die zweckmaSigerweise in 
der korrekten Richtung gelesen wird und die mit einem korrekten 
Leseraster ausgestattet ist. Fur die Verbindung der DNA-Fragmente 
15 (= erf indungsgema£e Nukleinsauren) miteinander konnen an die 
Fragmente Adaptoren oder Linker angesetzt werden. 

ZweckmaSigerweise konnen die Promoter- und die Terminator- 
Regionen in Transkriptionsrichtung mit einem Linker oder Poly- 

20 linker, der eine oder mehrere Restriktionsstellen ftir die 

Insertion dieser Sequenz enthalt, versehen werden. In der Regel 
hat der Linker 1 bis 10, meistens 1 bis 8, vorzugsweise 2 bis 
6 Restriktionsstellen. Im allgemeinen hat der Linker innerhalb 
der regulatorischen Bereiche eine Gr6£e von weniger als 100 bp, 

25 haufig weniger als 60 bp, mindestens jedoch 5 bp. Der Promotor 
kann sowohl nativ bzw. homolog als auch fremdartig bzw. hetero- 
log zum Wirtsorganismus beispielsweise zur Wirtspflanze sein. 
Die Expressionskassette beinhaltet in der 5 ' -3 ' -Transkriptions- 
richtung den Promotor, eine DNA- Sequenz , die fur ein im er- 

30 f indungsgemaSen Verfahren verwendetes A6-Desaturase-Gen codiert 
und eine Region fur die transkriptionale Termination. Ver- 
schiedene Terminationsbereiche sind gegeneinander beliebig 
austauschbar . 

35 Ferner konnen Manipulationen, die passende Restriktionsschnitt- 
stellen bereitstellen oder die uberflussige DNA oder Restrik- 
tionsschnittstellen entfernen, eingesetzt werden. Wo Insertionen, 
Deletionen oder Subs ti tut ionen wie z.B. Transitionen und Trans- 
versionen in Frage kommen, konnen In vi tro-Mutagenese , -primer- 

40 repair-, Restriktion oder Ligation verwendet werden. Bei geeig- 
neten Manipulationen, wie z.B. Restriktion, -chewing-back- oder 
Auf fullen von Uberhangen fur -bluntends-, konnen komplementare 
Enden der Fragmente fur die Ligation zur Verfugung gestellt 
werden . 



45 
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Von Bedeutung fur eine vorteilhafte hohe Expression kann u.a. 
das Anhangen des spezifischen ER-Retent ions signals SEKDEL sein 
(Schouten, A. et al., Plant Mbl . Biol. 30 (1996), 781-792), die 
durchschnittliche Express ionshohe wird damit verdreifacht bis 
5 vervierf acht . Es konnen auch andere Retentions s ignale , die natiir- 
licherweise bei im ER lokalisierten pflanzlichen und tierischen 
Proteinen vorkommen, fur den Aufbau der Kassette eingesetzt 
werden . 

10 Bevorzugte Polyadenylierungss ignale sind pflanzliche Poly- 

adenylierungss ignale , vorzugsweise solche, die im wesentlichen 
T-DNA-Polyadenylierungss ignale aus Agrobacterium tumef aciens , 
insbesondere des Gens 3 der T-DNA (Octopin Synthase) des 
Ti-Plasmids pTiACHS entsprechen (Gielen et al., EMBO J . 3 (1984), 

15 835 ff) Oder entsprechende funktionelle Aquivalente. 

Die Herstellung einer Expressionskassette erfolgt durch Fusion 
eines geeigneten Promotors mit einer geeigneten A6-Desaturase- 
DNA-Sequenz sowie einem Polyadenylierungs signal nach gangigen 

20 Rekombinations- und Klonierungstechniken, wie sie beispielsweise 
in T. Maniatis, E.F. Fritsch und J. Sambrook, Molecular Cloning: 
A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring 
Harbor, NY (1989) sowie in T.J. Silhavy, M.L. Berman und L.W. 
Enquist, Experiments with Gene Fusions, Cold Spring Harbor 

25 Laboratory, Cold, Spring Harbor, NY (1984) und in Ausubel, F.M. 

et al.. Current Protocols in Molecular Biology, Greene Publishing 
Assoc. and Wiley- Interscience (1987) beschrieben werden. 

Die DNA Sequenz codierend fur eine A6-Desaturase aus Phsyco- 
30 mitrella patens beinhaltet alle Sequenzmerkmale , die notwendig 
sind, um eine dem Ort der Fettsaure-, Lipid- oder Olbiosynthese 
korrekte Lokalisation zu erreichen. Daher sind keine weiteren 
Targetingseguenzen per se notwendig.. Allerdings kann eine solche 
Lokalisation wunschenswert und vorteilhaft sein und daher kiinst- 
35 lich verandert oder verstarkt werden, sodaS auch solche Fusions- 
konstrukte eine bevorzugte vorteilhafte Ausf uhrungsf orm der 
Erfindung sind. 

Insbesondere bevorzugt sind Sequenzen, die ein Targeting in 
40 Plastiden gewahrleisten . Unter bestimmten Umstanden kann auch 
ein Targeting in andere Kompartimente (referiert: Kermode, Crit. 
Rev. Plant Sci . 15, 4 (1996), 285-423) z.B. in in die Vakuole, 
in das Mitochondrium, in das Endoplasmatische Retikulum (ER) , 
Peroxisomen, Lipidkorper oder durch ein Fehlen entsprechender 
45 operativer Sequenzen ein Verbleib im Kompartiment des Entstehens, 
dem Zytosol, wunschenswert sein. 
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Vorteilhaf terwelse werden die fur A6-Desaturase-Gene codierenden - 
Nukleinsauresequenzen zusammen mit mindestens einem Reportergen 
in eiiie Expressionskassette kloniert, die in den Organismus uber 
einen Vektor oder direkt in das Genom eingebracht wird. Dieses 
5 Reportergen sollte eine leichte Detektierbarkeit iiber einen 

Wachstums-, Fluoreszenz-, Chemo-, Biolumineszenz- oder Resistenz- 
assay oder iiber eine photometrische Messung ermoglichen. Bei- 
spielhaft seien als Reportergene Antibiotika-oder Herbizid- 
resis tenzgene , Hydrolasegene , Fluoreszenzproteingene , Biolumin- 

10 eszenzgene, Zucker- oder Nukleotidstof fwechselgene oder Bio* 
synthesegene wie das Ura3-Gen, das Ilv2-Gen, das Lucif erasegen, 
das ji-Galactosidasegen, das gfp-Gen, das 2-Desoxyglucose-6- 
phosphat-Phosphatasegen, das (5-Glucur oni das e-Gen, fJ-Lactamasegen, 
das Neomycinphosphotransf erasegen, das Hygromycinphosphotrans- 

15 f erasegen oder das BASTA (= Gluphosinatresistenz ) -Gen genannt. 
Diese Gene ermoglichen eine leichte Me£barkeit und Quantif izier- 
barkeit der Transkriptionsaktivitat und" damit der Expression der 
Gene. Damit lassen sich Genomstellen identif izieren, die eine 
unterschiedliche Produktivitat zeigen. 

20 

GemaS einer bevorzugten Ausf iihrungsf orm umfa£t eine Expressions- 
kassette stromaufwarts, d.h. am 5'-Ende der codierenden Sequenz , 
einen Promotor und stromabwarts, d.h. am 3'-Ende, ein Poly- 
adenylierungssignal und gegebenenf alls weitere regulatorische 

25 Elemente, welche mit der dazwischenliegenden codierenden Sequenz 
fur die A6-Desaturase DNA Sequenz operativ verkniipft sind. Unter 
einer operativen Verkmipfung versteht man die seguenzielle 
Anordnung von Promotor, codierender Sequenz, Terminator und ggf . 
weiterer regulativer Elemente derart, da£ jedes der regulativen 

30 Elemente seine Funktion bei der Expression der codierenden 
Sequenz bestimmungsgemaE erfiillen kann. Die zur operativen 
Verkniipfung bevorzugten Sequenzen sind Targeting-Sequenzen zur 
Gewahrleistung der subzellularen Lokalisation in Plastiden. Aber 
auch Targeting-Sequenzen zur Gewahrleistung der subzellularen 

35 Lokalisation im Mitochondrium, im Endoplasmatischen Retikulum 

(= ER) , im Zellkern, in Olkorperchen oder anderen Kompartimenten 
sind bei Bedarf einsetzbar sowie Trans lat ions vers tar ker wie die 
5 '-Fiihrungs sequenz aus dem Tabak-Mosaik-Virus (Gallie et al., 
Nucl. Acids Res. 15 (1987), 8693-8711). 

40 

Eine Expressionskassette kann beispielsweise einen konstitutiven 
Promotor (bevorzugt den USP- oder Napin-Promotor ) , das zu 
exprimierende Gen und das ER-Retentionssignal enthalten. Als 
ER-Retentions signal wird bevorzugt die Aminos auresequenz KDEL 
45 (Lysin, Asparaginsaure , Glutaminsaure, Leucin) verwendet. 
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Die Expressionskassette wird zur Expression in einem pro- 
karyontischen oder eukaryontischen Wirtsorganismus beispielsweise 
einem Mikroorganismus wie einem Pilz oder einer Pflanze vorteil- 
hafterweise in einen Vektor wie beispielsweise einem Plasmid, 
5 einem Phagen oder sonstiger DNA inseriert, der eine optimale 
Expression der Gene im Wirtsorganismus ermoglicht. Geeignete 
Plasmide sind beispielsweise in E. coli pLG338, pACYC184 , pBR- 
Serie wie z.B. pBR322, pUC-Serie wie pUC18 oder pUC19, Mll3mp- 
Serie, pKC30, pRep4 , pHSl, pHS2 , pPLc236, pMBL24, pLG200, pUR290, 

10 pIN-III 113 -Bl, Xgtll oder pBdCI, in Streptomyces pIJlOl, pIJ364, 
pIJ702 oder pIJ361, in Bacillus pUBHO, pC194 oder pBD214, in 
Corynebac cerium pSA77 oder pAJ667, in Pilzen pALSl, pIL2 oder 
pBB116, weitere vorteilhafte Pilzvektoren werden von Romanos, 
M.A. et al., [(1992) "Foreign gene expression in yeast: a 

15 review", Yeast 8: 423-488] und von van den Hondel, C. A. M.J. J. et 
al. [(1991) "Heterologous gene expression in filamentous fungi] 
sowie in More Gene Manipulations in Fungi [J.W. Bennet & L.L. 
Lasure, eds . , p. 396-428: Academic Press: San Diego] und in "Gene 
transfer systems and vector development for filamentous fungi * 

20 [van den Hondel , C .A.M. J. J. & Punt, P.J. (1991) in: Applied 

Molecular Genetics of Fungi, Peberdy, J.F. et al . , eds., p. 1-28, 
Cambridge University Press: Cambridge] beschrieben. Vorteilhafte 
Hefevektoren sind beispielsweise 2jiM, pAG-1, YEp6, YEpl3 oder 
pEMBLYe23 . Beispiele fur Algen- oder Pf lanzenpromotoren sind 

25 pLiGV2 3 , pGHlac + , pBINl9 , pAK2004, pVKH oder pDH51 (siehe Schmidt, 
R. and Willmitzer, L. , 1988). Die oben genannten Vektoren oder 
Derivate der vorstehend genannten Vektoren stellen eine kleine 
Auswahl der moglichen Plasmide dar. Weitere Plasmide sind dem 
Fachmann wohl bekannt und konnen beispielsweise aus dem Buch 

30 Cloning Vectors (Eds. Pouwels P.H. et al. Elsevier, Amsterdam-New 
York-Oxford, 1985, ISBN 0 444 904018) entnommen werden. Geeignete 
pflanzliche Vektoren werden unter anderem in "Methods in Plant 
Molecular Biology and Biotechnology" (CRC Press), Kap. 6/7, 
S. 71-119 beschrieben. Vorteilhafte Vektoren sind sog. shuttle- 

35 Vektoren oder binare Vektoren, die in E. coli und Agrobacterium 
replizieren. 

Unter Vektoren sind au£er Plasmiden auch alle anderen dem 
Fachmann bekannten Vektoren wie beispielsweise Phagen, Viren 
40 wie SV40, CMV, Baculovirus, Adenovirus, Transposons, IS-Elemente, 
Phasmide, Phagemide, Cosmide, lineare oder zirkulare DNA zu 
verstehen. Diese Vektoren konnen autonom im Wirtsorganismus 
repliziert oder chromosomal repliziert werden, bevorzugt ist 
eine chromosomale Replikation. 
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In einer weiteren Ausgestaltungsf orm des Vektors kann die 
erf indungsgemaSe Expressionskassette auch vorteilhaf terweise in 
Form einer linearen DNA in die Organismen eingefuhrt werden und 
xiber heterologe oder homologe Rekombination in das Genom des 
5 Wirtsorganismus integriert werden. Diese lineare DNA kann aus 

einem linearisierten Plasmid oder nur aus der Expressionskassette 
als Vektor oder den erf indungsgemaSen Nukleinsauresequenzen 
bestehen. 



10 In einer weiteren vorteilhaf ten Aus fuhrungs form kann die 
er f indungsgema£e Nukleinsaureseguenz auch alleine in einen 
Organismus eingebracht werden. 

Sollen neben der erf indungsgema£en Nukleinsaureseguenz weitere 
15 Gene in den Organismus eingefuhrt werden, so konnen alle zusammen 
mit einem Report ergen in einem einzigen Vektor oder jedes ein- 
zelne Gen mit einem Reportergen in je einem Vektor oder mehrere 
Gene zusammen in verschiedenen Vektoren in den Organismus ein- 
gebracht werden, wobei die verschiedenen Vektoren gleichzeitig 
20 oder sukzessive eingebracht werden konnen. 

Der Vektor enthalt vorteilhaft mindestens eine Kopie der Nuklein- 
sauresequenzen, die fur eine A6-Desaturase codieren, und/oder der 
Expressionskassette . 

25 

Beispielhaft kann die pflanzliche Expressionskassette in den 
Transformationsvektor pRT ((a) Toepfer et al., 1993, Methods 
Enzymol., 217: 66-78; (b) Toepfer et al. 1987, Nucl. Acids. 
Res. 15: 5890 ff.) eingebaut werden. 

30 

Alternativ kann ein* rekombinanter. Vektor (= Expressionsvektor ) 
auch in-vitro transkribiert und translatiert werden, z.B. durch 
Nutzung des T7 Promotors und der T7 RNA Polymerase. 

35 In Prokaryoten verwendete Expressionsvektoren nutzen haufig 

induzierbare Systeme mit und ohne Fusionsproteinen bzw Fusions- 
oligopeptiden, wobei diese Fusionen sowohl N- terminal als auch 
C-terminal oder anderen nutzbaren Domanen eines Proteins erfolgen 
konnen. Solche Fusions vektor en dienen in der Regel dazu: i.) die 

40 Express ionsrate der RNA zu erhohen ii.) die erzielbare Protein- 
syntheserate zu erhohen, iii.) die Loslichkeit des Proteins zu 
erhohen, iv. ) oder die Reinigung durch einen fur die Affinitats- 
chromatographie nutzbare Bindeseguenz zu vereinf achen. Haufig 
werden auch proteolytische Spaltstellen iiber Fusionsproteine 

45 eingefuhrt, was die Abspaltung eines Teils des Fusionsproteins 
auch der Reinigung ermoglicht. Solche Erkennungssequenzen fur 
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Proteasen erkennen sind z.B. Faktor Xa, Thrombin und Entero- 
kinase . 

Typische vorteilhafte Fusions- und Express ions vektoren sind pGEX 
5 [Pharmacia Biotech Inc; Smith, D.B. and Johnson, K.S. (1988) Gene 
67: 31-40] , pMAL (New England Biolabs, Beverly, MA) and pRIT5 
(Pharmacia, Piscataway, NJ) welches Glutathion S- trans f erase 
beinhaltet (GST) , Maltose Bindeprotein, oder Protein A. 

10 Weitere Beispiele fur E. coli Express ions vektoren sind pTrc 

[Amann et al., (1988) Gene 69:301-315] und pET Vektoren [Studier 
et al., Gene Expression Technology: Methods in Enzymoiogy 185, 
Academic Press, San Diego, California (1990) 60-89; Stratagene, 
Amsterdam, Niederlande] . 

15 

Weitere vorteilhafte Vektoren zur Verwendung in Hefe sind 
pYepSecl (Baldari, et al . , (1987) Embo J. 6:229-234), pMFa 
(Kurjan and Herskowitz, (1982) Cell 30 : 933-943 ) , pJRY88 (Schultz 
et al., (1987) Gene 54 : 113-123 ) , and pYES-Derivate (Invitrogen 

20 Corporation, San Diego, CA) . Vektoren fur die Nutzung in fila- 

mentosen Pilzen sind beschrieben in: van den Hondel, C .A.M. J .J. & 
Punt, P.J. (1991) "Gene transfer systems and vector development 
for filamentous fungi, in: Applied Molecular Genetics of Fungi, 
J.F. Peberdy, et al . , eds . , p. 1-28, Cambridge University Press: 

25 Cambridge . 

Alternativ konnen auch vorteilhaft Insektenzellexpressionsvekto- 
ren genutzt werden z.B. fur die Expression in Sf 9 Zellen. Dies 
sind z.B. die Vektoren der pAc Serie (Smith et al. (1983) Mol . 
30 Cell Biol. 3:2156-2165) und der pVL series (Lucklow and Summers 
(1989) Virology 170:31-39) . 

Des weiteren konnen zur Genexpression vorteilhaft Pflanzen- 
zellen oder Algenzellen genutzt werden. Beispiele fur Pflanzen- 
35 express ions vektoren finden sich in Becker, D. , et al . (1992) 
"New plant binary vectors with selectable markers located 
proximal to the left border", Plant Mol. Biol. 20: 1195-1197 
oder in Bevan, M.W. (1984) "Binary A&robacterium vectors for 
plant transformation", Nucl. Acid. Res. 12: 8711-8721. 

40 

Weiterhin konnen die fur die A6-Desaturase codierenden Nuklein- 
sauresequenzen in Saugerzellen exprimiert werden. Beispiel 
fur entsprechende Expressionsvektoren sind pCDM8 und pMT2PC 
genannt in: Seed, B. (1987) Nature 329:840 oder Kaufman et al. 
45 (1987) EMBO J. 6: 187-195). Dabei sind vorzugsweise zu nutzende 
Promotoren viralen Ursprungs wie z.B. Promotoren des Polyoma, 
Adenovirus 2, Cytomegalovirus oder Simian Virus 40. Weitere 
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prokaryotische und eukaryotische Expressionssysteme sind genannt 
in Kapitel 16 und 17 in Sambrook et al., Molecular Cloning: 
A Laboratory Manual. 2nd, ed. , Cold Spring- Harbor Laboratory, 
Cold .Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 



Das Einbringen der erf indungsgema£en Nukleinsauren, der 
Expressionskassette oder des Vektors in Organismen beispiels- 
weise in Pflanzen kann prinzipiell nach alien dem Fachmann 
10 bekannten Methoden erfolgen. 

Fur Mikroorganismen kann der Fachmann entsprechende Methoden den 
Lebrbuchern von Sambrook, J. et al . (1989) Molecular cloning: 
A laboratory manual, Cold Spring Harbor Laboratory Press, von 

15 F.M. Ausubel et al . (1994) Current protocols in molecular bio- 
logy, John Wiley and Sons, von D.M. Glover et al., DNA Cloning 
Vol.1, (1995), IRL Press (ISBN 019-963476-9), von Kaiser et al. 
(1994) Methods in Yeast Genetics, Cold Spring Habor Laboratory 
Press oder Guthrie et al. Guide to Yeast Genetics and Molecular 

20 Biology, Methods in Enzymology, 1994, Academic Press entnehmen. 

Die Ubertragung von Fremdgenen in das Genom einer Pflanze wird 
als Transformation bezeichnet. Es werden dabei die beschriebenen 
Methoden zur Transformation und Regeneration von Pflanzen aus 

25 Pflanzengeweben oder Pf lanzenzellen zur transienten oder stabilen 
Transformation genutzt. Geeignete Methoden sind die Protoplasten- 
transformation durch Polyethylenglykol-induzierte DNA-Auf nahme , 
das biolistische Verfahren mit der Genkanone - die sogenannte 
particle bombardment Methode die Elektroporation, die Inku- 

30 bation trockener Embryonen in DNA-haltiger Losung, die Mikro- 
injektion und der durch Agrobacterium vermittelte Gentransfer. 
Die genannten Verfahren sind beispielsweise in B. Jenes et al. , 
Techniques for Gene Transfer, in: Transgenic Plants, Vol. 1, 
Engineering and Utilization, herausgegeben von S.D. Kung und R. 

35 Wu, Academic Press (1993) 128-143 sowie in Potrykus Annu. Rev. 

Plant Physiol. Plant Molec. Biol. 42 (1991) 205-225) beschrieben. 
Vorzugsweise wird das zu exprimierende Konstrukt in einen Vektor 
kloniert, der geeignet ist, Agrobacterium tumefaciens zu trans - 
formieren, beispielsweise pBinl9 (Bevan et al., Nucl. Acids Res. 

40 12 (1984) 8711). Mit einem solchen Vektor trans formierte Agro- 
bakterien konnen dann in bekannter Weise zur Transformation von 
Pflanzen, insbesondere von Kulturpf lanzen, wie z.B. von Tabak- 
pflanzen, verwendet werden, indem beispielsweise verwundete 
Blatter oder Blattstucke in einer Agrobakterienlosung gebadet 

45 und anschlieSend in geeigneten Medien kultiviert werden. Die 
Transformation von Pflanzen mit Agrobacterium tumefaciens wird 
beispielsweise von Hofgen und Willmitzer in Nucl. Acid Res. 



5 1989. 
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(1988) 16, 9 877 beschrieben oder ist unter anderem bekannt aus 
F.F. White, Vectors for Gene Transfer in Higher Plants; in Trans- 
genic Plants, Vol. 1, Engineering and Utilization, herausgegeben 
von S.D. Kung und R. Wu, Academic Press, 1993, S. 15-38. 

5 

Mit einem wie oben beschriebenen Expressionsvektor transf ormierte 
Agrobakterien konnen ebenfalls in bekannter Weise zur Trans- 
formation von Pflanzen wie Testpflanzen wie Arabidopsis oder 
Kulturpf lanzen wie Getreide, Mais, Hafer, Roggen, Gerste, Weizen, 

10 Soja, Reis, Baumwolle, Zuckerriibe, Canola, Triticale, Reis, 
Sonnenblume, Flachs , Hanf, Kartoffel, Tabak, Tomate, Kaffee, 
Kakao, Tee, Karotte, Paprika, Raps, Tapioka, Maniok, Pfeilwurz, 
Tagetes, Alfalfa, Salat und den verschiedenen Baum-, NuE- 
und Weinspezies, insbesondere von 01-haltigen Kulturpf lanzen, 

15 wie Soja, Erdnu£, Rizinus, Borretsch, Lein, Sonnenblume, 

Canola, Baumwolle, Flachs, Raps, Kokosnu£, Olpalme, Farbersaflor 
(Carthamus tinctorius) oder Kakaobohne verwendet werden, z.B. 
indem verwundete Blatter oder Blattstucke in einer Agrobakterien- 
losung gebadet und anschlie£end in geeigneten Medien kultiviert 

20 werden. 

Die genetisch veranderten Pf ianzenzellen konnen uber alle dem 
Fachmann bekannten Methoden regeneriert werden. Entsprechende 
Methoden konnen den oben genannten Schriften von S.D. Kung und 
25 R. Wu, Potrykus oder Hofgen und Willmitzer entnommen werden. 

Als Organismen bzw. Wirtsorganismen fur die erf indungsgema£en 
Verfahren verwendeten Nukleinsauren, die verwendete Expressions - 
kassette oder den verwendeten Vektor eignen sich prinzipiell 

30 vorteilhaft alle Organismen, die in der Lage sind Fettsauren 
speziell ungesattigte Fettsauren zu synthetisieren bzw. fur die 
Expression rekombinanter Gene geeignet sind. Beispielhaft seien 
Pflanzen wie Arabidopsis, Asteraceae wie Calendula oder Kultur- 
pf lanzen wie Soja, ErdnuE, Rizinus, Sonnenblume, Mais, Baum- 

35 wolle, Flachs, Raps, Kokosnu£, Olpalme, Farbersaflor (Carthamus 
tinctorius) oder Kakaobohne, Mikroorganismen wie Pilze bei- 
spielsweise die Gattung Mortierella, Saprolegnia oder Pythium, 
Bakterien wie die Gattung Escherichia, Cyanobakterien, Ciliaten, 
Thrausto- oder Schizichytrien, Algen oder Protozoen wie Dino- 

40 flagellaten wie Crypthecodinium genannt . Bevorzugt werden 
Organismen, die naturlicherweise Ole in groSeren Mengen 
synthetisieren konnen wie Pilze der Gattungen Mortierella oder 
Pythium wie Mortierella alpina, Pythium insidiosum oder Pflanzen 
wie Soja, Raps, KokosnuS, Olpalme, Farbersaflor, Rizinus, 

45 Calendula, Erdnufc, Kakaobohne oder Sonnenblume, besonders bevor- 
zugtr werden Soja, Raps, Sonnenblume, Rizinus, Mortierella oder 
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Pythium. Prinzipiell sind als Wirtsorganismen auch transgene 
Tiere geeignet beispielsweise C. elegans . 

Nutzbare Wirt sz ell en sind weiterhin genaxmt in: Goeddel, Gene 
5 Expression Technology: Methods in Enzymology 185, Academic Press, 
San Diego, CA (1990) . 

Verwendbare Express ionsstamme z.B. solche, die eine geringere 
Proteaseaktivitat aufweisen sind beschrieben in: Gottesman, S., 
10 Gene Expression Technology: Methods in Enzymology 185, Academic 
Press, San Diego, California (1990) 119-128. 

Dabei kann je nach Wahl des Promotors die Expression des 
A6-Desaturase-Gens spezifisch in den Blattern, in den Samen, den 

15 Knollen oder anderen Teilen der Pflanze erfolgen. Solche Fett- 

sauren, Ole oder Lipide mit A6-Doppelbindungen uberproduzierenden 
transgenen Pflanzen, deren Vermehrungsgut , sowie deren Pflanzen- 
zellen, -gewebe oder -teile, sind ein weiterer Gegenstand der 
vorliegenden Erfindung. Ein bevorzugter erf indungsgema£er Gegen- 

20 stand sind transgene Pflanzen beispielsweise Kulturpf lanzen wie 
Mais, Hafer, Roggen, Weizen, Gerste, Mais, Reis, Soja, Zucker- 
rube, Canola, Triticale, Sonnenblume, Flachs, Hanf, Tabak, 
Tomate, Kaffee, Kakao, Tee, Karotte, Paprika, Raps, Tapioka, 
Maniok, Pfeilwurz, Tagetes, Alfalfa, Salat und den verschiedenen 

25 Baum-, Nu£- und Weinspezies, Kartoffel, insbesondere 01-haltige 
Kulturpf lanzen, wie Soja, ErdnuS, Rizinus, Borretsch, Lein, 
Sonnenblume, Canola, Baumwolle, Flachs, Raps, KokosnuS, Olpalme, 
Farbersaflor (Carthamus tinctorius) oder Kakaobohne, Testpf lanzen 
wie Arabidopsis oder sonstige Pflanzen wie Moose oder Algen ent- 

30 haltend eine erf indungsgema£e funktionelle Nukleinsaureseguenz 
oder eine funktionelle Expressionskassette . Funktionell bedeutet 
hierbei, da£ ein enzymatisch aktives Enzym gebildet wird. 

Die Expressionskassette oder die erf indungsgema£en Nukleinsaure- 
35 sequenzen enthaltend eine A6-Desaturasegensequenz kann dariiber 
hinaus auch zur Transformation der oben beispielhaft genannten 
Organismen wie Bakterien, Cyanobakterien, filamentosen Pilzen, 
Ciliaten, Tiere oder Algen mit dem Ziel einer Erhohung des 
Gehaltes an Fettsauren, Olen oder Lipiden A6-Doppelbindungen ein- 
40 gesetzt werden. Bevorzugte transgene Organismen sind Bakterien, 
Cyanobakterien, filamentose Pilze oder Algen. 

Unter transgenen Organismen sind Organismen zu verstehen, die 
eine Fremde aus einem anderen Organismus stammende Nuklein- 
45 saure, die fur eine im erf indungsgema£en Verfahren verwendete 
A6-Desaturase codiert, enthalten. Unter transgenen Organismen 
sind auch Organismen zu verstehen, die eine Nukleinsaure , die 
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aus demselben Organismus stammt und die fur eine . A6-Desaturase 
codiert, enthS.lt/ wobei diese Nukleinsaure als zusatzliche 
Genkopie enthalten ist oder nicht in der natiirlichen Nuklein- 
saureumgebung des A6-Desaturase-Gens enthalten ist. Transgene 
5 Organismen sind auch Organismen bei denen die naturliche 3'- 
und/oder 5 ' -Region des A6-Desaturase-Gens durch gezielte gen- 
technologische Veranderungen gegeniiber dem Aus gangs or ganismus 
verandert wurde. Bevorzugt sind transgene Organismen bei denen 
eine Fremd-DNA eingebracht wurde. Besonders bevorzugt sind trans- 
it) gene Pflanzen, in die Fremd-DNA eingebracht wurde . Unter trans- 
genen Pflanzen sind einzelne Pf lanzenzellen und deren Kulturen 
wie beispielsweise Kalluskulturen auf Festmedien oder in Flussig- 
kultur, Pf lanzenteile und ganze Pflanzen zu verstehen. 

15 Ein weiterer Erf indungsgegenstand sind transgene Organismen 
ausgewahlt aus der Gruppe Pflanzen, Pilze, Ciliaten, Algen, 
Bakterien, Cyanobakterien oder Tiere, bevorzugt transgene 
Pflanzen oder Algen, die mindestens eine isolierte Nukleinsaure- 
sequenz enthalten, die fur ein Polypeptid mit A6-Desaturase- 

20 aktivitat codiert, ausgewahlt aus der Gruppe: 

a) einer Nukleinsaureseguenz mit der in SEQ ID NO: 1 dar- 
gestellten Sequenz, 

25 b) Nukleinsauresequenzen, die sich als Ergebnis des degenerier- 
ten genetischen Codes von der in SEQ ID NO: 1 ableiten 

c) Derivate der in SEQ ID NO: 1 dargestellten Nukleinsaure- 
sequenz, die fur Polypeptide mit der in SEQ ID NO: 2 
30 dargestellten Aminosauresequenzen codieren und mindestens 

50 % Homologie auf Aminosaureebene aufweisen, ohne da£ die 
enzymatische Wirkung der Polypeptide wesentlich reduziert 
ist. 

35 Erhohung des Gehaltes von Fettsauren, Olen oder Lipiden mit 

A6-Doppelbindungen bedeutet im Rahmen der vorliegenden Erfindung 
beispielsweise die kunstlich erworbene Fahigkeit einer erhohten 
Biosyntheseleistung durch funktionelle Uberexpression des 
A6-Desaturase-Gens in den erf indungsgemafien Organismen vorteil- 

40 haft in den erf indungsgemafcen transgenen Pflanzen gegeniiber den 
nicht gentechnisch modif izierten Ausgangspf lanzen zumindest fur 
die Dauer mindestens einer Pf lanzengeneration. 

Der Biosyntheseort von Fettsauren, Olen oder Lipiden beispiels- 
45 weise ist im allgemeinen der Samen oder Zellschichten des Samens , 
so da£ eine samenspezif ische Expression des A6-Desaturase-Gens 
sinnvoll ist. Es ist jedoch naheliegend, da£ die Biosynthese 
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von Fettsauren, Olen oder Lipiden nicht auf das Samengewebe 
beschrankt sein mu£, sondern auch in alien ubrigen Teilen der 
Pflanze - beispielsweise in Epidermiszellen oder in den Knollen 
-gewebe spezifisch erfolgen kann. 

5 

Daruberhinaus ist eine konstitutive Expression des exogenen 
A6-Desaturase-Gens von Vorteil. Andererseits kann aber auch 
eine induzierbare Expression wiinschenswert erscheinen. 

10 Die Wirksamkeit der Expression des A6-Desaturase-Gens kann 

beispielsweise in vitro durch Spro&neristemvermehrung ermittelt 
werden. Zudem kann eine in Art und Hohe veranderte Expression 
des A6-Desaturase-Gens und deren Auswirkung auf die Fettsaure-, 
01- oder Lipidbiosyntheseleistung an Testpflanzen in Gewachshaus- 

15 versuchen getestet werden. 

Gegenstand der Erfindung sind wie oben beschrieben transgene 
Pflanzen, trans fonniert mit einer Nukleinsauresequenz , die fur 
eine A6-Desaturase codiert, einem Vektor oder einer Expressions* 
20 kassette enthaltend eine A6-Desaturase-Genseguenz oder mit dieser 
hybridisierende DNA-Seqrienzen, sowie transgene Zellen, Gewebe, 
Teile und Vermehrungsgut solcher Pflanzen. Besonders bevorzugt 
sind dabei transgene Kulturpf lanzen wie oben beschrieben. 

25 Pflanzen im Sinne der Erfindung sind mono- und dikotyle Pflanzen 
oder Algen. 

Weitere Gegenstande der Erfindung sind: 

30 - Verwendung einer A6-Desaturase-DNA-Gensequenz mit der in 

SEQ ID N0:1 genannten Sequenz oder mit dieser hybridisierende 
DNA-Sequenzen zur Herstellung von Pilzen, Bakterien, Tieren 
oder Pflanzen bevorzugt Pflanzen mit erhohtem Gehalt an 
Fettsauren, Olen oder Lipiden mit A6-Doppelbindungen durch 

35 Expression dieser A6-Desaturase DNA-Sequenz in Pflanzen. 

- Verwendung der Proteine mit den Sequenzen SEQ ID NO: 2 zur 

Herstellung von ungesattigten Fettsauren in Pflanzen, Pilzen, 
Bakterien oder Tieren bevorzugt Pflanzen. 



45 



WO 01/02591 




PCT/EP00/06223 



Die Erfindung wird durch die folgenden Beispiele naher erlautert : * 
Beispiele 

5 Beispiel 1: Allgemeine Klonierungsverf ahren und Anzuchts- 
verf ahren: 

Die Klonierungsverfahren wie z.B. Restriktionsspaltungen, 
Agarose-Gelelektrophorese, Reinigung von DNA-Fragmenten, Transfer 

10 von Nukleinsauren auf Nitrozellulose und Nylon Membranen, Ver- 
kmipfen von DNA-Fragmenten, Transformation von Escherichia coli 
Zellen, Anzucht von Organismen und die Sequenzanalyse rekombinan- 
ter DNA wurden wie bei Sambrook et al . (1989) (Cold Spring Harbor 
Laboratory Press: ISBN 0-87969-3 09-6) beschrieben durchgef uhrt . 

15 Das Protonema von Physcomitrella patens (= P. patens) wurde in 
Fliissigmedium, wie von Reski et al . (Mol . Gen. Genet., 244, 1994: 
352-359) beschrieben, angezogen. 

Beispiel 2 : Sequenzanalyse rekombinanter DNA 

20 

Die Sequenzierung rekombinanter DNA-Molekiile erfolgte mit einem 
Laserfluoreszenz-DNA-Sequenzierer der Firma ABI nach der Methode 
von Sanger (Sanger et al . (1977) Proc . Natl. Acad. Sci. USA7 4 , 
5463-5467) . Fragmente resultierend aus einer Polymerase Ketten- 
25 reaktion wurden zur Vermeidung von Polymerasef ehlern in zu 
exprimierenden Konstrukten sequenziert und uberpruft. 

Beispiel 3: Lipidanalyse aus dem Protonema von P. patens und 
aus Hefezellen 

30 

Die Lipide wurden mit Chlorof orm/Methanol wie bei Siebertz 

et al. (Eur. J. Biochem. , 101, 1979: 429-438) beschrieben 

aus dem Protonema von S. patens oder aus Hefezellen extrahiert 

und iiber Dunnschichtchromatographie (= TLC) mit Diethylether ge- 

35 reinigt. Die erhaltenen Fettsauren wurden zu den entsprechenden 
Methylestern transmethyliert und mit Gaschromatographie (= GO 
analysiert. Die verschiedenen Methylester wurden mit den ent- 
sprechenden Standards identif iziert . Entsprechende Fettsaure- 
pyrrolididen wurden, wie bei Anderson et al . (Lipids, 9, 1974: 

40 185-190) beschrieben, erhalten und mit GC-MS bestimmt . 



45 
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Beispiel 4: Funktionelle Expression der A6-Desaturase cDNA von 
P. patens in Hefen 



Die Expression-Experimente in Hefen wurden mit PPDES6-cDNA durch- 
5 gefiihrt. Knock- out -Exprimente hatten gezeigt (Daten und Versuchs- 
durchfuhrung nicht gezeigt bzw. beschrieben) , da£ der Knock-out 
zu einem Verlust an 20 : 3 11 > 14 > 17 - , 20:4 s ' 8 ' 11 ' 14 -, 20 : 4 5 ' 11 < 14 - 1 7- und 
20:5 5 ' 8 ' 11 ' 14 - 17 -Fettsauren f iihrt . Gleichzeitig steigen die 18:2 9 ' 12 - 
und 18 : 3 9 ' 12 ' 15 -Fettsauren an. Fur-die Expression in Hefe wurde 

10 der PPDES 6 - c DNA in den Hef e-Expressionsvektor pYES2 (Invitrogen) 
subkloniert. Der erhaltene Vektor erhielt die Bezeichnung pYES- 
delta6. Mit pYES2 (Kontroile) und pYESdelta6 (A6-Desaturase-cDNA) 
trans formierte Hefekulturen wurden auf Uracil-dop-out Medium mit 
2 % Raffinose und 1 % Tergitol NP-4 0 (zur Stabilisierung der 

15 Fettsauren) angezogen. Fur die Expression wurden die Zellen mit 
Galactose (Endkonzentration 2 %) bis zu einer optischen Dichte 
(= OD) von 0,5 bei 600nm angezogen. In Fiitterungsexperimenten 
wurden Fettsauren in 5 % Tergitol solubilisiert und mit einer 
Endkonzentration von 0,0003 % zugesetzt. Die Ergebnisse der 

20 Expression sind Tabelle I zu entnehmen. Die Synthese von Fett- 
sauren mit einer Doppelbindung an Position 6 ist nur in Gegenwart 
des Expressionskonstrukts mit der A6-Desaturase-cDNA moglich. 
Dieses A6-Desaturase-Enzym hat eine groSere Aktivitat gegenuber 
Fettsauren, die schon eine Doppelbindung an Position 9 oder 12 

25 (Bezug auf Kohlenstof f atom in der Kette) enthalten. Es wurden 
die Fettsauremethylester des gesamten Lipids der Hefen mit GC 
analysiert. Die einzelnen synthetisierten Fettsauren werden in 
der Tabelle in Mol-% der gesamten Fettsauren angegeben. 



30 Tabelle I: Fettsaurezusammensetzung in trans formier ten Hefen 
gegemiber der Kontroile 



35 



40 



Gesamt Fettsaure (%) 




pYES2 




pYESdelta6 




Fettsauren 






+ 18:29.12 


+ 18:39.12,15 


16:0 


16,4 


16, 1 


23, 8 


25,8 


16:19 


54,0 


55, 5 


38, 1 


31,4 


16:26-9 




4,2 


1,7 




18:0 


3,2 


2,4 


4, 0 




18:19 


24,9 


19,7 


19, 1 


19,2 


18:26.9 




0,6 


0,2 




18:29.12 






8,5 




18:3 6 '9,12 






4,0 




18:39.12,15 








11, 7 


18 :46,9,12,15 








3,0 
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Beispiel 5: Transformation von P. patens 

Die Polyethylenglycol vermittelte direkte DNA- Trans format ion von 
Protoplasten wurde, wie von Schafer et al . (Mol. Gen. Genet., 
5 226, 1991: 418-424) beschrieben, durchgef tlhrt . Die Selektion 
der Trans forman ten erfolgte auf G418-enthaltenden Medium (Girke 
et al., The Plant Journal, 15, 1998: 3 9-48). 

Beispiel 6: Isolierung von A6-Desaturase cDNA und genomischen 
10 Clonen von P. patens 

Mit Hilfe eines PCR-Ansatzes mit den folgenden degenerierten 
Oligonukleotiden als Primer: 

15 A: TGGTGGAA(A/G)TGGA(C/A) ICA(T/C)AA und 

B: GG (A/G) AA (A/C/G/T) A (A/G) ( G/A) TG (G/A) TG (C/T) TC] 

und dem folgenden Temper aturprogramm : 

94°C, 3 min; [94°C, 20 sec; 45°C, 30 sec; 72°C, 1 min] , 30 Zyklen; 

20 72°C, 5 min, wurden schlieSlich Fragmente einer A6-Desaturase-Gen 
kloniert. Fur die Kldnierung wurde poly(A)RNA aus 12 Tage alten 
P. patens Protonema-Kultur isoliert. Mit dieser poly(A)RNA wurde 
die oben beschriebene PCR durchgef lihrt . Fragmente der erwarteten 
Fragment lange (500 bis 600 bp) wurden in pUC18 kloniert und 

25 sequenziert. Die abgeleitete Aminosauresequenz eines PCR- Frag- 
ments zeigte Ahnlichkeiten zu bekannten A6-Desaturasen. Da bekannt 
war, da£ P. patens eine A6-Desaturase besitzt, wurde angenommen, 
da£ dieser Klon fur einen Teil einer A6-Desaturase codiert. 

30 Ein vollstandiger cDNA-Klon (= PPDES 6 - cDNA ) wurde aus einer 

p. patens cDNA-Bank von 12 Tage alten Protonemata mit Hilfe des 
oben genannten PCR-Fragments isoliert. Die Nukleotidsequenz wird 
in SEQ ID NO:l wiedergegeben . Die abgeleitete Aminosauresequenz 
ist SEQ ID NO: 2 zu entnehmen. Die zugehorige genomische Sequenz 

35 (= PPDES6-Gen) konnte mit Hilfe der PCR und den folgenden Oligo- 
nukleotiden als Primer isoliert werden: 

C: CCGAGTCGCGGATCAGCC 

D : C AGTAC ATTC GGTC ATTC AC C : 

40 

Tabelle II gibt die Ergebnisse des Vergleichs zwischen der neuen 
P. patens A6-Desat"urase liber die gesamte Nukleinsauresequenz mit 
folgenden bekannten A6-Desaturase wieder: Borago officinalis 

(U79010), Synechocystis sp (L11421), Spirulina platensis 
45 (X87094), Caenorhabiditis elegans (AF031477), Mortierella alpina 

(WO 98/46764), Homo sapiens (Cho et al., J. Biol. Chem. , 274, 
1999: 471-477), Rattus . norvegicus (AB021980) und Mus mus cuius 
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(Cho et aL, J. Biol. Chem. , 274, 1999: 471-477). Die Analyse 
wurde mit dem Gap Programm (GCG-Package, Version 9,1) und den 
folgenden Analysenparametern durchgefiihrt : scoring matrix, 
blosum62, gap creation penalty, 12; gap extension penalty, 4.. 
5 Die Ergebnisse geben die bestimmte Identitat oder Ahnlichkeit [] 
in Prozent (%) iin Vergleich zur P. patens -Sequenz wieder. 

Tabelle II: Sequenzvergleich zwischen P. patens A6-Desaturase und 
anderen A6-Desaturasen 

10 



15 



Sequenz 


Aminosaur eseguenz - Identitat 
[Ahnlichkeit] (%) 


Borago officinalis 


31 [38] 


Synechocystis sp. 


21 [29] 


Spirulina platensis 


20 [29] 


Caenorhabditis elegans 


35 [43] 


Mortierella alpina 


39 [47] 


Homo sapiens 


27 [38] 


Rattus norvegicus 


28 [39] 


Mus mus cuius 


29 [39] 



Beispiel 7: Klonierung der A6-Desaturase aus Physcomitrella 
25 patens 

Die genomische A6-Acyllipid-Desaturase aus Physcomitrella patens 
wurde auf Grundlage der verof f entlichten Sequenz (Girke et al. # 
Plant J., 15, 1998: 39-48) mittels Polymerasekettenreaktion und 
30 Klonierung modifiziert, isoliert und fur das erf indungsgema£e 
Verfahren eingesetzt. Dazu wurde zunachst mittels Polymerase- 
kettenreaktion unter Verwendung von zwei " genspezif ischen Primern 
ein Desaturase-Fragment isoliert und in das bei Girke et al . 
(siehe oben) beschriebene Desaturasegen eingesetzt. 

35 

Primer TG5 : 5'- ccgctcgagcgaggttgttgtggagcggc und 
Primer TG3 : 5 % -ctgaaatagtcttgctcc-3 ' 

dienten zunachst zur Amplif izierung eines Genf ragmentes mittels 
40 Polymerasekettenreaktion (30 Zyklen, 30 sek. 94° V, 30 sek. 50°C, 
60 sek. 72°C / 10 min Nachinkubation bei 72°C / in einem Perkin 
Elmer Thermocycler ) . 



45 
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a) Rlonierung eines Express ionsplasmids , das die A6-Desaturase - 
unter Kontrolle des 35S CaMV Promotors exprimiert : 

Durch Primer TG5 wurde eine Xhol Schnittstelle in das 
5 Fragment eingefuhrt. Ein Xhol /Eco47 III Fragment wurde durch 

Restriktion erhalten und in die bei Girke et al . beschriebene 
PPDES6-Genseguenz nach analoger Restriktion mit XhoI/Eco47III 
ausgetauscht . Das Ronstrukt erhielt den Namen pZR. Das Insert 
von pZR wurde als Xhol/Hindlll Fragment nach Auffullen der 

10 Hindlll-Schnittstelle mit Nukleotiden durch Behandlung mit 

dem Rlenow Fragment der DNA Polymerase I in die Xhol/Smal 
Schnittstellen von pRT99/35S kloniert. Das resultierende 
Plasmid pSK enthalt den 35S-Promotor [Caulif lower-Mosaik-Vi- 
rus, Franck et al . (1980) Cell 21, 285], die A6-Desaturase 

15 aus Moos und den 35S-Terminator im Vektor pRT. 

b) Ronstruktion eines Expressionskonstruktes unter Rontrolle 
des Napin-Promotors: 

20 Durch Schneiden des Plasmides pSR mit Xhol, Behandlung mit 

T4-DNA Polymerase und Pstl-Restriktion wurde das erhaltene 
Promotor-Desaturase-Fragment mit Terminator in den Vektor 
pJH3 kloniert. Dazu wurde der Vektor BamHI geschnitten und 
mit Rlenow-Enzym die Oberhange aufgefullt sowie anschlie£end 

25 mit PstI nachgeschnitten. Es entstand durch Ligation des 

Desaturase-Terminator-Fragmentes in den Vektor das Plasmid 
pJH7, das einen Napin-Promotor beinhaltet (Scofield et al . , 
1987, J. Biol. Chem. 262, 12202-8). Die Expressionskassette 
aus pJH7 wurde mit Bspl20I und Not I geschnitten und in den 

30 binaren Vektor pRE kloniert. -Es entstand das Plasmid pRE- 

Ppdes6. 

In einer PCR Reaktion wurde die erf indungsgemaSe 
A6-Desaturase cDNA aus P. patens als Matrize verwendet. 

35 Mithilfe der nachfolgend aufgefuhrten Oligonukleotide wurde 

eine BamHI-Restriktionsschnittstelle vor dem Startcodon und 
drei Adeninnukleotide als Konsensustranslationsseouenz fur 
Eukaryoten in die A6-Desaturase cDNA eingefuhrt. Es wurde 
ein 1512 Basenpaarf ragment der A6-Desaturase amplifiziert 

40 und sequenziert. 

Pp-d6Desl: 5 ' - CC GGTACC aaaatggtattcgcgggcggtg -3' 
Pp-d6Des2: 3'- CC GGTACC ttaactggtggtagcatgct -3' 



45 



Die Reaktionsgemische enthielten ca. 1 ng/micro 1 Matrizen 
DNA, 0,5 MM der Oligonukleotide und, 200 \M Desoxy-Nukleotide 
(Pharmacia), 50 mM. KC1, 10 mM Tris-HCl (pH 8,3 bei 25°C, 
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1*5 mM MgCl 2 ) und 0,02 U/}Xl Pwo Polymerase (Boehringer Mann- 
heim) und werden in einer PCR-Maschine der Firma Perkin Elmer 
mit folgendem Temperaturprogramm inkubiert: 

5 Anl agerungs temper a tur : 50°C, 3 0 sec 

Denaturierungstemperatur : 95°C, 30 sec 

Elongations temperatur: 72°C, 90 sec 

Anzahl der Zyklen: 3 0 

10 c) Konstruktion eines Expressionskonstruktes -unter Kontrolle des 
USP-Promotors : 

Das erhaltene Fragment von ca. 1,5 kB Basenpaaren wurde in 
den mit EcoRV gespaltenen Vektor pBluescript SK- (Stratagene) 
ligiert und stand fur weitere Klonierungen als BamHI Fragment 
15 . zur Verfugung. 

Fur die Transformation von Pflanzen wurde ein weiterer Trans- 
format ions vektor auf Basis von pBin-USP erzeugt, der das 
BamHI -Fragment der A6-Desaturase enthalt . pBin-USP ist ein 

20 Deri vat des Plasmides pBin!9 . pBinUSP entstand aus pBinl9, 

indem in pBinl9 [Bevan et al . (1980) Nucl. Acids Res. 12, 
8711] ein USP-Promotor als EcoRI-BaMHI-Fragnient inseriert 
wurde. Das Polyadenylierungssignal ist das des Gens 3 der 
T-DNA des Ti-Plasmides pTiACHS (Gielen et al., (1984) EMBO 

25 J. 3, 835), wobei Nukleotide 11749-11939 als PvuII-Hindlll- 

Fragment isoliert und nach Addition von Sphl-Linkera an die 
PvuII-Schnittstelle zwischen die SpHI-Hindlll Schnittstelle 
des Vektors kloniert. Der USP-Promotor entspricht den Nukleo- 
tiden 1-684 (Genbank Accession X56240) , wobei ein Teil der 

30 nichtcodierenden Region des USP-Gens im Promotor enthalten 

ist. Das 684 Basenpaar gro&e Promotorf ragment wurde mittels 
kauflichen T7-Standardprimer (Stratagene) und mit Hilfe eines 
synthetisierten Primers uber eine PCR-Reaktion nach Standard- 
methoden amplifiziert (Primersequenz : 5 ' -GTCGACCCGCGGACTAGTG- 

35 GGCCCTCTAGACCCGGGGGATCC GGATCTGCTGGCTATGAA-3 ' ) . Das PCR- 

Fragment wurde mit EcoRI/Sall nachgeschnitten und in den 
Vektor pBin!9 mit OCS Terminator eingesetzt. Es entstand 
das Plasmid mit der Bezeichnung pBinUSP. 

40 d) Konstruktion eines Expressionskonstruktes unter Kontrolle 
des vATPase-Cl-Promotors aus Beta vulgaris: 

Analog zum Expressionsplasmid mit dem USP-Promotor wurde 
ein Konstrukt unter Verwendung des v-ATPase-cl-Promotors 
45 erstellt. Der Promotor wurde als EcoRI/Kpnl Fragment in das 

Plasmid pBinl9 mit OCS Terminator kloniert und liber BamHI 
das A6-Desaturasegen aus P. patens zwischen Promotor und 
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Terminator inseriert . Der Promotor entspricht einem 1153 
Basenpaarfragment aus beta-Vulgaris (Plant Mol Biol, 1999 , 
39:463-475) . 



5 



Das Konstrukt wurde zur Transformation von Arabidopsis 



thaliana und Rapspflanzen eingesetzt. 
Beispiel 8 : Erzeugung transgener Rapspflanzen (verandert nach 



Zur Erzeugung transgener Rapspflanzen wurden binare Vektoren in 
Agrobacterium tumefaciens C58C1 :pGV2260 oder Escherichia coli 
genutzt (Deblaere et al , 1984, Nucl. Acids. Res. 13, 4777-4788). 
Zur Transformation von Rapspflanzen (Var. Drakkar, NPZ Nord- 

15 deutsche Pf lanzenzucht , Hohenlieth, Deutschland) , wurde eine 1:50 
Verdiinnung einer Ubernachtkuitur einer positiv transf ormierten 
Agrobakterienkolonie in Murashige-Skoog Medium (Murashige und 
Skoog 1962 Physiol. Plant. 15, 473) mit 3 % Saccharose ( 3MS- 
Medium) benutzt . Petiolen oder Hypokotyledonen frisch gekeimter 

20 steriler Rapspflanzen (zu je ca. 1 cm 2 ) wurden in einer Petri - 
schale mit einer 1:50 Agrobakterienverdiinnung fur 5-10 Minuten 
inkubiert. Es folgte eine 3-tagige Inkubation in Dunkelheit bei 
25°C auf 33yiS-Medium mit 0,8 % Bacto-Agar. Die Kultivierung wurde 
nach 3 Tagen mit 16 Stunden Licht/8 Stunden Dunkelheit weiter- 

25 gefuhrt und in wochentlichem Rhythmus auf MS-Medium mit 500 mg/1 
Claforan (Cef otaxime-Natrium) , 50 mg/1 Kanamycin, 20 ]iM Benzyl - 
aminopurin (BAP) und 1,6 g/1 Glukose weitergef uhrt . Wachsende 
Sprosse wurden auf MS-Medium mit 2 % Saccharose, 250 mg/1 
Claforan und 0,8 % Bacto-Agar uberfuhrt. Bildeten sich nach 

30 drei Wochen keine Wurzeln, so wurde als Wachstumshormon 2-Indol- 
buttersaure zum Bewurzeln zum Medium zugegeben. 

Regenerierte Sprosse wurden auf 2MS-Medium mit Kanamycin 

und Claforan erhalten, nach Bewurzelung in Erde uberfuhrt und 

35 nach Kultivierung fur zwei Wochen in einer Klimakammer oder im 
Gewachshaus angezogen, zur Blute gebracht, reife Samen geerntet 
und auf A6-Desaturase-Expression mittels Lipidanalysen unter- 
sucht. Linien mit erhohten Gehalten an oder Doppelbindungen an 
der A6-Position wUrden identif iziert . Es konnte in den stabil 

40 transf ormierten transgenen Linien, die das Transgen funktionell 
exprimierten, ein erhohter Gehalt von Doppelbindungen an 
der A6-Position im Vergleich zu untransf ormierten Kontroll- 
pflanzen feststellt werden. 



Moloney et al . , 1992, Plant Ceil Reports, 8:238-242) 
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Beispiel 8 : Lipidextraktion aus Samen 

Das Pf lanzenmaterial wurde zunachst mechanisch durch Morsern 
homogenisiert , um es einer Extraktion zuganglicher zu machen. 

5 

Dann wurde es 10 min bei 100°C abgekocht und nach dem Abkuhlen auf 
Eis sedimentiert . Das Zellsediment wurde mit 1 N methanol ischer 
Schwef elsaure und 2 % Dimethoxypropan lh bei 90 °C hydrolysiert 
und die Lipide transirtethyliert . Die resultierenden Fettsaure- 

10 methylester {FAME ) wurden in Petrolether extrahiert. Die extra- 
hierten FAME wurden durch Gasf liissigkeitschromatographie mit 
einer Kapillarsaule (Chrompack, WCOT Fused Silica, CP-Wax-52 CB, 
25 m, 0,32 mm) und einem Temperaturgradienten von 170°C auf 240°C 
in 20 min und 5 min bei 240°C analysiert. Die Identitat der Fett- 

15 sauremethylester wurde durch Vergleich mit entsprechenden FAME- 
Standards (Sigma) bestatigt. Die Identitat und die Position der 
Doppelbindung konnte durch geeignete chemische Derivatisierung 
der FAME-Gemische z.B. zu 4 , 4-Dimethoxyoxazolin-Derivaten 
(Christie, 1997, in: Advances in Lipid Methodology, 4. Auflage: 

20 Christie, Oily Press, Dundee, 119-169, und 1998, Gaschromato- 

graphie-Massenspektrometrie Verf ahren, Lipide 33:343-353) mittels 
GC-MS weiter analysiert werden. Die GC-Analysen der Fettsaure- 
methylester aus den transgenen Rapssamen, die samenspezif isch die 
A6-Desaturase exprimierten sind in Tabelle III dargestellt. Die 

25 transgenen Rapssamen weisen mindestens 4,95 % y-Linolensaure im 
Samen auf . 

Tabelle III gibt die GC-Analysen der Fettsauremethylester 
aus reifen, transgenen Rapssamen, die A6-Desaturase samen- 
30 spezifisch exprimieren, wieder. Die Fettsaurezusammensetzung ist 
in [mol %] der Gesamtf ettsauren angegeben. Es ist f estzustellen, 
da£ einzelne Pflanzen der T2 Generation, die aus positiv trans- 
formierten und geselbsteten Pflanzen erhalten wurden, bis zu ca. 
4,95 % Y-Linolensaure enthalten. 

35 
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Tabelle III: GC-Analysen der Fettsauremethylester von Raps 
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24 , 54 


3, 88 


10,12 


0 


R2-T2-11/4 


1,72 


48, 02 


24, 66 


4, 95 


9,52 


0 


R2-T2-ll/5a 


1,73 


51,98 


25,27 


4,27 


9, 61 


0 


R2-T2-ll/5b 


2, 02 


56,19 


25, 08 


0 


9,33 


0 


R2-T2-ll/5c 


2, 01 


46, 95 


27, 38 


0 


10,37 


0 


R2-T2-ll/5d 


1,83 


49,49 


24,15 


4, 40 


8, 65 


0 


R2-T2-11/6 


2, 08 


54,52 


23,94 


2, 05 


9,29 


0 


R2-T2-11/10 


1, 94 


53,92 


22, 81 


4, 06 


9,44 


0 


R2-T2-WT 


1, 90 


47,75 


30,91 


0 


10, 51 


0 
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Patentanspruche 

1. Verfahren zur Herstellung von ungesattigten Fettsauren, da- 
5 durch gekermzeichnet , da£ mindestens eine isolierte Nuklein- 

sauresequenz, die fur ein Polypeptid mit A6-Desaturase- 
aktivitat codiert, ausgewahlt aus der Gruppe: 

a) einer Nukleinsauresequenz iait der in SEQ ID NO: 1 dar- 
10 gestellten Seguenz , 

b) Nukleinsauresequenz en, die sich als Ergebnis des 
degenerierten genetischen Codes von der in SEQ ID NO: 1 
ableiten 

15 

c) Derivate der in SEQ ID NO: 1 dargestellten Nukleinsaure- 
sequenz, die fur Polypeptide mit der in SEQ ID NO: 2 dar- 
gestellten Amino s aur es eguenz en codieren und mindestens 

50 % Homologie auf Aminosaureebene aufweisen, ohne da£ 
20 die enzymatische Wirkung der Polypeptide wesentlich 

reduziert ist, 

in einen Organismus eingebracht wird, dieser Organismus 
angezogen wird, wobei der angezogene Organismus mindestens 
25 1 Mol-% ungesattigte Fettsauren bezogen auf den gesamten 

Fettsauregehalt im Organismus enthalt . 

2. Verfahren nach Anspruch 1, dadurch gekennzeichnet , da£ die 
Nukleinsauresequenz von einer Pflanze oder Alge stairant. 

30 

3. Verfahren nach Anspruch 1 oder 2, dadurch gekennzeichnet, 
da£ die Nukleinsauresequenz von Physcomitrella patens stammt. 

4. Verfahren nach den Anspruchen 1 bis 3, dadurch gekerrn- 

35 zeichnet, da£ es sich bei dem Organismus urn ein organismus 

ausgewahlt aus der Gruppe Bakterium, Pilz, Ciliat, Alge, 
Cyanobakterium, Tier oder Pflanze handelt. 

5. Verfahren nach den Anspruchen 1 bis 4, dadurch gekenn- 
40 zeichnet, da£ es sich bei dem Organismus um eine Pflanze 

oder Alge handelt. 
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5 7. 



10 8. 



9. 

15 



20 



25 



30 



10. 

35 



Verf ahren nach den Anspriichen 1 bis 5 , dadurch gekenn- 
zeichnet, da£ es sich bei dem Organismus um eine Olfrucht- 



Verf ahren nach den Anspriichen 1 bis 6, dadurch gekenn- 
zeichnet, daS der angezogene Organismus mindestens 5 Gew-% 
ungesattigte Fettsauren bezogen auf den gesamten Fettsaure- 
gehalt im Organismus enthalt . 

Verf ahren nach den Anspriichen 1 bis 7, dadurch gekenn- 
zeichnet, da£ die ungesattigten Fettsauren aus dem Organismus 
isoliert werden. 

Transgener Organismus ausgew^hlt aus der Gruppe Pflanzen, 
Pilze, Ciliaten, Algen, Bakterien, Cyanobakterien oder Tiere, 
die mindestens eine isolierte Nukleinsauresequenz enthalten, 
die fur ein Polypeptid mit A6-Desaturaseaktivitat codiert, 
ausgewahlt aus der Gruppe: 

a) einer Nukleinsauresequenz mit der in SEQ ID NO: 1 
dargestellten Seguenz , 

b) Nukleinsauresequenzen, die sich als Ergebnis des 
degenerierten genetischen Codes von der in SEQ ID NO: 1 
ableiten 

c) Derivate der in SEQ ID NO: 1 dargestellten Nukleinsaure- 
sequenz, die fur Polypeptide mit der in SEQ ID NO: 2 dar- 
gestellten Amino sauresequenz en codieren und mindestens 

50 % Homologie auf Aminos aureebene aufweisen, ohne da£ 
die enzymatische Wirkung der Polypeptide wesentlich 
reduziert ist. 

Transgener Organismus nach Anspruch 9 , dadurch gekenn- 
zeichnet, da£ es sich bei dem Organismus um eine Pflanze 
oder Alge handelt. 

01, Lipide oder Fettsauren oder eine Fraktion davon, her- 
gestellt durch das Verfahren nach einem der Anspruche 1 
bis 8. 

Verwendung der 01-, Lipid- oder Fettsaurezusammensetzung nach 
Anspruch 11 oder transgene Organismen nach Anspruch 9 in 
Futtermitteln, Nahrungsmitteln, Kosmetika oder Pharmazeutika . 



pflanzen handelt. 
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SEQUENZ PROTOKOLL 



<17 0> Patent In Vers. 2.0 

<210> 1 
<211> 2012 
<212> DNA 

<213> Physcomitrella patens 

<220> 
<221> CDS 

<222> (319) . . (1896) 
<400> 1 

ccgagtcgcg gatcagccat cgcccgccca gggccgcctg cattgtgtgg gacggtgttg 60 

gaggaggagg cagatgcgcg ggcgttggtg gagtcgtcat ccgaggatct actgcggcaa 120 

tacctccggg ttttggagcg ggcaaactct gttgcggctc ggaaggctat aggttcggca 180 

ggagactgtt gattttatgt cgggggcatt gccattgtgg agagcggggg agactcagga 240 

tctgtgagtg tgcgtgcagc gccccgactg ccgcagagcg tctgtgtatg acgaggttgt 3 00 

tgtggagcgg cttttgaa atg gta ttc gcg ggc ggt gga ctt cag cag ggc 351 

Met Val Phe Ala Gly Gly Gly Leu Gin Gin Gly 

.1 5 10 

tct etc gaa gaa aac ate gac gtc gag cac att gec agt atg tct etc 399 
Ser Leu Glu. Glu Asn lie Asp Val Glu His lie Ala Ser Met Ser Leu 
15 20 25 

ttc age gac ttc ttc agt tat gtg tct tea act gtt ggt teg tgg age 447 
Phe Ser Asp Phe Phe Ser Tyr Val Ser Ser Thr Val Gly Ser Trp Ser 
30 35 40 

gta cac agt ata caa cct ttg aag cgc ctg acg agt aag aag cgt gtt 495 
Val His Ser lie Gin Pro Leu Lys Arg Leu Thr Ser Lys Lys Arg Val 
45 50 55 

teg gaa age get gee gtg caa tgt ata tea get gaa gtt cag aga aat 543 
Ser Glu Ser Ala Ala Val Gin Cys lie Ser Ala Glu Val Gin Arg Asn 
60 65 70 75 

teg agt acc cag gga act gcg gag gca etc gca gaa tea gtc gtg aag 591 
Ser Ser Thr Gin Gly Thr Ala Glu Ala Leu Ala Glu Ser Val Val Lys 

80 85 90 



ccc acg aga cga agg tea tct cag tgg aag aag teg aca cac ccc eta 
Pro Thr Arg Arg Arg Ser Ser Gin Trp Lys Lys Ser Thr His Pro Leu 
95 100 105 



639 



WO 01/02591 PCT/EP00/O6223 



tea gaa gta gca gta cac aac aag cca age gat tgc tgg att gtt gta 687 
Ser Glu Val" Ala Val His Asn Lys Pro Ser Asp Cys Trp lie Val Val 
110 115 120 



aaa aac aag gtg tat gat gtt tec aat ttt gcg gac gag cat ccc gga 735 
Lys Asn Lys Val Tyr Asp Val Ser Asn Phe Ala Asp Glu His Pro Gly 
125 130 135 



gga tea gtt att agt act tat ttt gga cga gac ggc aca gat gtt ttc 783 

Gly Ser Val lie Ser Thr Tyr Phe Gly Arg Asp Gly Thr Asp Val Phe 

140 145 150 155 

tct agt ttt cat gca get tct aca tgg aaa att ctt caa gac ttt tac 831 

Ser Ser Phe His Ala Ala Ser Thr Trp Lys He Leu Gin Asp Phe Tyr 

160 165 170 



att ggt gac gtg gag agg gtg gag ccg act cca gag ctg ctg aaa gat 879 
He Gly Asp Val Glu Arg Val Glu Pro Thr Pro Glu Leu Leu Lys Asp 
175 180 185 



ttc cga gaa atg aga get ctt ttc ctg agg gag caa ctt ttc aaa agt 927 
Phe Arg Glu Met Arg Ala Leu Phe Leu Arg Glu Gin Leu Phe Lys Ser 
190 195 200 



teg aaa ttg tac tat gtt atg aag ctg etc acg aat gtt get att ttt 975 

Ser Lys Leu Tyr Tyr Val Met Lys Leu Leu Thr Asn Val Ala He Phe 
205 ' 210 215 

get gcg age att gca ata ata tgt tgg age aag act att tea gcg gtt 1023 

Ala Ala Ser He Ala He He Cys Trp Ser Lys Thr He Ser Ala Val 

220 225 230 235 



ttg get tea get tgt atg atg get ctg tgt ttc caa cag tgc gga tgg 
Leu Ala Ser Ala Cys Met Met Ala Leu Cys Phe Gin Gin Cys Gly Trp 
240 245 . 250 



1071 



eta tec cat gat ttt etc cac aat cag gtg ttt gag aca cgc tgg ctt 1119 
Leu Ser His Asp Phe Leu His Asn Gin Val Phe Glu Thr Arg Trp Leu 
255 260 265 



aat gaa gtt gtc ggg tat gtg ate ggc aac gec gtt ctg ggg ttt agt 1167 
Asn Glu Val Val Gly Tyr Val He Gly Asn Ala Val Leu Gly Phe Ser 
270 275 280 



aca ggg tgg tgg aag gag aag cat aac ctt cat cat get get cca aat 1215 
Thr Gly Trp Trp Lys Glu Lys His Asn Leu His His Ala Ala Pro Asn 
285 290 295 



gaa tgc gat cag act tac caa cca att gat gaa gat att gat act etc 
Glu Cys Asp Gin Thr Tyr Gin. Pro He Asp Glu Asp He Asp Thr Leu 



1263 
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300 305 310 315 

ccc etc att* gec tgg age aag gac ata ctg gec aca gtt gag aat aag 1311 

Pro Leu lie Ala Trp Ser Lys Asp lie Leu Ala Thr Val Glu Asn Lys 
320 325 330 



1359 



1407 



aca ttc ttg cga ate etc caa tac cag cat ctg ttc ttc atg ggt ctg 
Thr Phe Leu Arg lie Leu Gin Tyr Gin His Leu Phe Phe Met Gly Leu 
335 340 345 

tta ttt ttc gee cgt ggt agt tgg etc ttt tgg age tgg aga tat ace 
Leu Phe Phe Ala Arg Gly Ser Trp Leu Phe Trp Ser Trp Arg Tyr Thr 
350 355 360 

tct aca gca gtg etc tea cct gtc gac agg ttg ttg gag aag gga act 1455 
Ser Thr Ala Val Leu Ser Pro Val Asp Arg Leu Leu Glu Lys Gly Thr 
365 370 375 

gtt ctg ttt cac tac ttt tgg ttc gtc ggg aca gcg tgc tat ctt etc 1503 
Val Leu Phe His Tyr Phe Trp Phe Val Gly Thr Ala Cys Tyr Leu Leu 
380 385 390 395 

cct ggt tgg aag cca tta gta tgg atg gcg gtg act gag etc atg tec 1551 
Pro Gly Trp Lys Pro Leu Val Trp Met Ala Val Thr Glu Leu Met Ser 
400 405 410 

ggc atg ctg ctg ggc ttt gta ttt gta ctt age cac aat ggg atg gag 1599 
Gly Met Leu Leu Gly Phe Val Phe Val Leu Ser His Asn Gly Met Glu 
415 420 425 

gtt tat aat teg tct aaa gaa ttc gtg agt gca cag ate gta tec aca 1647 
Val Tyr Asn Ser Ser Lys Glu Phe Val Ser Ala Gin He Val Ser Thr 
430 435 440 

egg gat ate aaa gga aac ata ttc aac gac tgg ttc act ggt ggc ctt 1695 
Arg Asp He Lys Gly Asn He Phe Asn Asp Trp Phe Thr Gly Gly Leu 
445 450 455 

aac agg caa ata gag cat cat ctt ttc cca aca atg ccc agg cat aat 1743 
Asn Arg Gin He Glu His His Leu Phe Pro Thr Met Pro Arg His Asn 
460 465 470 475 

tta aac aaa ata gca cct aga gtg gag gtg ttc tgt aag aaa cac ggt 1791 
Leu Asn Lys He Ala Pro Arg Val Glu Val Phe Cys Lys Lys His Gly 
480 485 490 

ctg gtg tac gaa gac gta tct att get ace ggc act tgc aag gtt ttg 1839 
Leu Val Tyr Glu Asp Val Ser He Ala Thr Gly Thr Cys Lys Val Leu 
495 500 505 

aaa gca ttg aag gaa gtc gcg gag get gcg gca gag cag cat get ace 1887 
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Lys Ala Leu Lys Glu Val Ala Glu Ala Ala Ala Glu Gin His Ala Thr 
510 515 520 

acc agt taa cagtctttgg aaagcttggc aattgatctt tattctccac 193 6 

Thr Ser 
525 

ggcagttgct tgtttgtttt ggggtgaatg accgaatgta ctggcatcca ttcttctgta 1996 
gccatcaatt ttgaac 2 012 



<210> 2 
<211> 525 
<212> PRT 

<213> Physcomitrella patens 
<400> 2 

Met Val Phe Ala Gly Gly Gly Leu Gin Gin Gly Ser Leu Glu Glu Asn 
15 10 15 

lie Asp Val Glu His lie Ala Ser Met Ser Leu Phe Ser Asp Phe Phe 
20 25 30 

Ser Tyr Val Ser Ser Thr Val Gly Ser Trp Ser Val His Ser lie Gin 
35 40 45 

Pro Leu Lys Arg Leu Thr Ser Lys Lys Arg Val Ser Glu Ser Ala Ala 
50 55 60 

Val Gin Cys He Ser Ala Glu Val Gin Arg Asn Ser Ser Thr Gin Gly 
65 70 75 80 

Thr Ala Glu Ala Leu Ala Glu Ser Val Val Lys Pro Thr Arg Arg Arg 

85 90 95 

Ser Ser Gin Trp Lys Lys Ser Thr His Pro Leu Ser Glu Val Ala Val 
100 105 110 

His Asn Lys Pro Ser Asp Cys Trp He Val Val Lys Asn Lys Val Tyr 
115 120 125 

Asp Val Ser Asn Phe Ala Asp Glu His Pro Gly Gly Ser Val He Ser 
130 135 140 

Thr Tyr Phe Gly Arg Asp Gly Thr Asp Val Phe Ser Ser Phe His Ala 
145 150 155 160 



Ala Ser Thr Trp Lys He Leu Gin Asp Phe Tyr He Gly Asp Val Glu 
165 170 175 
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Arg Val Glu Pro Thr Pro Glu Leu Leu Lys Asp Phe Arg Glu Met Arg 
180 185 190 

Ala Leu Phe Leu Arg Glu Gin Leu Phe Lys Ser Ser Lys Leu Tyr Tyr 
195 200 205 

Val Met Lys Leu Leu Thr Asn Val Ala He Phe Ala Ala Ser He Ala 
210 215 220 

He He Cys Trp Ser Lys Thr He Ser Ala Val Leu Ala Ser Ala Cys 
225 230 235 240 

Met Met Ala Leu Cys Phe Gin Gin Cys Gly Trp Leu Ser His Asp Phe 
245 250 255 

Leu His Asn Gin Val phe Glu Thr Arg Trp Leu Asn Glu Val Val Gly 
260 265 270 

Tyr Val He Gly Asn Ala Val Leu Gly Phe Ser Thr Gly Trp Trp Lys 
275 280 285 

Glu Lys His Asn Leu His His Ala Ala Pro Asn Glu Cys Asp Gin Thr 
290 295 300 

Tyr Gin Pro He Asp Glu Asp He Asp Thr Leu Pro Leu He Ala Trp 
305 310 315 320 

Ser Lys Asp He Leu Ala Thr Val Glu Asn Lys Thr Phe Leu Arg He 
325 330 335 

Leu Gin Tyr Gin His Leu Phe Phe Met Gly Leu Leu Phe Phe Ala Arg 
340 345 350 

Gly Ser Trp Leu Phe Trp Ser Trp Arg Tyr Thr Ser Thr Ala Val Leu 
355 360 365 

Ser Pro Val Asp Arg Leu Leu Glu Lys Gly Thr Val Leu Phe His Tyr 
370 375 380 

Phe Trp Phe Val Gly Thr Ala Cys Tyr Leu Leu Pro Gly Trp Lys Pro 
385 390 395 400 

Leu Val Trp Met Ala Val Thr Glu Leu Met Ser Gly Met Leu Leu Gly 
405 410 415 

Phe Val Phe Val Leu Ser His Asn Gly Met Glu Val Tyr Asn Ser Ser 
420 425 430 



Lys Glu Phe Val Ser Ala Gin He Val Ser Thr Arg Asp He Lys Gly 
435 440 445 
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Asn lie Phe Asn Asp 
450 

His His Leu Phe Pro Thr Met Pro Arg His Asn Leu Asn Lys lie Ala 
465 470 475 480 

Pro Arg Val Glu Val Phe Cys Lys Lys His Gly Leu Val Tyr Glu Asp 
485 490 495 

Val Ser lie Ala Thr Gly Thr Cys Lys Val Leu Lys Ala Leu Lys Glu 
500 505 510 
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Trp Phe Thr Gly Gly Leu Asn Arg Gin lie Glu 
455 460 



Val Ala Glu Ala Ala Ala Glu Gin His Ala Thr Thr Ser 
515 520 525 
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Internationale Patentklassifikation (IPK) Oder nationale Klassifikation und iPK 
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Anmelder 
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1. Dieser Internationale vorlaufige Prufungsbericht wurde von der mit der internationalen vorlaufigen Prufung beauftragten 
Behorde erstellt und wird dem Anmelder gemaB Artikel 36 ubermittelt. 



2. Dieser BERICHT umfaGt insgesamt 6 Blatter einschlieBlich dieses Deckblatts. 

□ AuBerdem liegen dem Bericht ANLAGEN bei; dabei handelt es sich urn Blatter mit Beschreibungen, Anspruchen 
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3. Dieser Bericht enthalt Angaben zu folgenden Punkten: 

I IS Grundlage des Berichts 

II □ Prioritat 

III □ Keine Erstellung eines Gutachtens uber Neuheit, erfinderische Tatigkeit und gewerbliche Anwendbarkeit 

IV □ Mangelnde Einheitlichkeit der Erfindung 

V Begrundete Feststellung nach Artikel 35(2) hinsichtlich der Neuheit, der erfinderischen Tatigkeit und der 
gewerblichen Anwendbarkeit; Unterlagen und Erklarungen zur Stutzung dieser Feststellung 

VI □ Bestimmte angefuhrte Unterlagen 

VII □ Bestimmte Mangel der internationalen Anmeldung 

VIII H Bestimmte Bemerkungen zur internationalen Anmeldung 
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INTERNATiONALER VORLAUFIGER 

PRUFUNGSBERICHT Internationales Aktenzeichen PCT/E POO/06223 

I. Grundlage des Berichts 

1 . Hinsichtlich der Bestandteile der internationalen Anmeldung (Ersatzbiatter, die dem Anmeldeamt auf eine 
Aufforderung nach Artikel 14 hin vorgelegt warden, gelten im Rahmen dieses Berichts als "ursprunglich 
eingereicht" und sind ihm nicht beigefugt, weil sie keine Anderungen enthalten (Regeln 70. 16 und 70. 17)): 
Beschreibung, Seiten: 

1-34 ursprungliche Fassung 

Patentanspruche, Nr.: 

1-12 ursprungliche Fassung 

Sequenzprotokoll in der Beschreibung, Seiten: 

1-6, In der ursprunglich eingereichten Fassung. 



2. Hinsichtlich der Sprache: Alle vorstehend genannten Bestandteile standen der Behorde in der Sprache, in der 
die internationale Anmeldung eingereicht worden ist, zur Verfugung oder wurden in dieser eingereicht, sofern 
unter diesem Punkt nichts anderes angegeben ist. 

Die Bestandteile standen der Behorde in der Sprache: zur Verfugung bzw. wurden in dieser Sprache 
eingereicht; dabei handelt es sich um 

□ die Sprache der Obersetzung, die fur die Zwecke der internationalen Recherche eingereicht worden ist (nach 
Regel 23.1(b)). 

□ die Veroffentlichungssprache der internationalen Anmeldung (nach Regel 48.3(b)). 

□ die Sprache der Obersetzung, die fur die Zwecke der internationalen vorlaufigen Prufung eingereicht worden 
ist (nach Regel 55.2 und/oder 55.3). 

3. Hinsichtlich der in der internationalen Anmeldung offenbarten Nucleotide und/oder Aminosauresequenz ist die 
internationale vorlaufige Prufung auf der Grundlage des Sequenzprotokolls durchgefuhrt worden, das: 

E3 in der internationalen Anmeldung in schriftlicher Form enthalten ist. 

£3 zusammen mit der internationalen Anmeldung in computerlesbarer Form eingereicht worden ist. 

□ bei der Behorde nachtraglich in schriftlicher Form eingereicht worden ist. 

□ bei der Behorde nachtraglich in computerlesbarer Form eingereicht worden ist. 

□ Die Erklarung, daB das nachtraglich eingereichte schriftliche Sequenzprotokoll nicht uber den 
Offenbarungsgehalt der internationalen Anmeldung im Anmeldezeitpunkt hinausgeht, wurde vorgelegt. 

□ Die Erklarung, daB die in computerlesbarer Form erfassten Informationen dem schriftlichen 
Sequenzprotokoll entsprechen, wurde vorgelegt. 

4. Aufgrund der Anderungen sind folgende Unterlagen fortgefallen: 

□ Beschreibung, Seiten: 



Formblatt PCT/IPEA/409 (Felder l-VIII. Blatt 1) (Jul! 1 



4 




INTERNATIONALER VORLAUFIGER 

PRUFUNGSBERICHT Internationales Aktenzeichen PCT/E POO/06223 



□ Anspruche, Nr.: 

□ Zeichnungen, Blatt: 

5. □ Dieser Bericht 1st ohne Berucksichtigung (von einigen) der Anderungen erstellt worden, da diese aus den 
angegebenen Grunden nach Auffassung der Behorde uber den Offenbarungsgehalt in der ursprunglich 
eingereichten Fassung hinausgehen (Regel 70.2(c)). 

(Auf Ersatzblatter, die solche Anderungen enthalten, ist unter Punkt 1 hinzuweisen;sie sind diesem Bericht 
beizufugen). 



6. Etwaige zusatzliche Bemerkungen: 



V. Begrundete Feststellung nach Artikel 35(2) hinsichtlich der Neuheit, der erfinderischen Tatigkeit und der 
gewerblichen Anwendbarkeit; Unterlagen und Erkiarungen zur Stutzung dieser Feststellung 

1. Feststellung 

Neuheit (N) Ja: Anspruche 5,6 

Nein: Anspruche 1-4,7-12 

Erfinderische Tatigkeit (ET) Ja: Anspruche 

Nein: Anspruche 1-12 

Gewerbliche Anwendbarkeit (GA) Ja: Anspruche 1 -12 

Nein: Anspruche 



2. Unterlagen und Erkiarungen 
siehe Beiblatt 



VIII. Bestimmte Bemerkungen zur internationalen Anmeldung 

Zur Klarheit der Patentanspruche, der Beschreibung und der Zeichnungen Oder zu der Frage, ob die Anspruche 
in vollem Umfang durch die Beschreibung gestutzt werden, ist folgendes zu bemerken: 
siehe Beiblatt 



Formblatt PCT/IPEA/409 (Felder I- VIII, Blatt 2) (Juli 1998) 



IIVITERNATIONALER vorlaufiger 
PRUFUNGSBERICHT - BEIBLATT 



Ad section V.: 

1 . Auf folgende Dokumente wird in diesem Bescheid Bezug genommen: 

D1 The plant Journal 15(1), 39-48, 1998 
D2 WO-A-98/46764 
D3 WO-A-96/21022 

2. Die vorliegende Internationale Anmeldung betrifft ein Verfahren zur Herstellung 
von ungesattigten Fettsauren. Es werden transgene Organismen hergestellt 
(vorzugsweise Pflanzen, Algen oder Pilze), die aufgrund der Expression einer 
delta-6-Desaturase aus Moos einen erhohten Gehalt an Fettsauren, Olen oder 
Lipiden mit delta-6-Doppelbindungen aufweisen. Desweiteren betrifft die 
Internationale Anmeldung die fur das obige Verfahren hergestellten transgenen 
Organismen, die durch das Verfahren hergestellten Ole, Lipide oder Fettsauren, 
sowie deren Verwendung in Futtermitteln, Nahrungsmitteln, Kosmetika oder 
Pharmazeutika. 

Im Hinblick auf die im Internationalen Recherchenbericht zitierten Dokumente 
konnen nur die Anspruche 5 und 6 der vorliegenden Internationalen Anmeldung 
als neu betrachtet werden (Artikel 33(2) PCT). 

2.1 D1 offenbart die Isolierung und Klonierung einer cDNA sowie der dazu 

korrespondierenden geriomischen DNA-Sequenz aus dem Moos Physcomitrella 
patens. Das von dieser DNA kodierte Protein wurde als eine delta-6-Desaturase 
identifiziert. Durch Expression des Proteins in S.cerevisiae sowie durch Analyse 
der aus diesem transgenen Organismus gewonnenen Fettsauren konnte bestatigt 
werden, daf3 die klonierte DNA fur eine delta-6-Desaturase kodiert. Sowohl die 
Nukleotid-Sequenz als auch die Aminosaure-Sequenz der in D1 isolierten DNA 
bzw. des korrespondierenden Proteins weisen eine 1 00 %ige Identitat uber die 
gesamte Lange mit der in der vorliegenden Internationalen Anmeldung 
offenbarten Sequenz SEQ ID NO:1 bzw. SEQ ID NO:2 auf (s.D1,S.44-47, 
'Functional expression of PPDES6 in Saccharomyces cerevisiae', 
'Discussion'/Expression in S.cerevisiae', 'Lipid analysis' and Fig.1). 

Das obige Dokument ist daher neuheitsschadlich fur den Gegenstand der 




Internationales Aktenzeichen PCT/E POO/06223 



Formblatt PCT/Beiblatt/409 (Blatt 1) (EPA-April 1 



iNternationaler vorlaufiger 
prufungsbericht - beiblatt 



Internationales Aktenzeichen PCT/E POO/06223 



Anspruche 1-4 und 7-1 1 . 

2.2 D2 beschreibt eine Methode zur Herstellung von mehrfach ungesattigten 
langkettigen Fettsauren in Pflanzen. Expressionskonstrukte enthaltend DNA- 
Sequenzen kodierend fur eine delta-6-, delta-5- Oder delta-12-Desaturase wurden 
zunachst zur Herstellung dieser transgenen Pflanzen verwendet. Es wurde 
gezeigt, da(3 eine Expression dieser Desaturasen in den Pflanzen die Herstellung 
von groBen Mengen an mehrfach ungesattigten Fettsauren ermoglicht, und auf 
diese Weise zu einer Veranderung des Fettsaure-Profils dieser Pflanzen fuhrt. 
Diese Manipulation des Fettsaure-Profils erlaubt nunmehr die Herstellung von 
kommerziell nutzbaren Mengen an Pflanzenolen sowie deren Verwendung als 
Pharmazeutika, Nahrungsmittel etc. (s.D2, S.5, Zeile 27 - S.6, Zeile 17, S.8, Zeile 
19 - S.36, Zeile 27, Beispiele 6-8,13,14,16). 

In Hinblick auf D2 ist der Gegenstand der Anspruche 1 1 und 12 daher nicht neu. 

2.3 D3 offenbart die Klonierung einer DNA kodierend fur eine delta-6-Desaturase aus 
dem Cyanobakterium Synechocystis sowie einer cDNA kodierend fur eine delta-6- 
Desaturase aus Borretsch. Diese DNA-Sequenzen wurden in verschiedenen 
Organismen, wie z.B. in Tabakpflanzen, exprimiert, und es wurde gezeigt, daG in 
den transgenen Organismen mittels dieser Sequenzen ungesattigte Fettsauren, 
wie z.B. GLA, hergestellt wurden (s.D3, S.3, Zeilen 3-23, S.5. Zeile 16 - S. 19, 
Zeile 24, Beispiele 6,13,14, Anspruche 11-18). 

Das obige Dokument ist daher neuheitsschadlich fur den Gegenstand des 
Anspruches 11. 

3. Zur Beurteilung eines erfinderischen Schrittes der Anspruche 5 und 6 der 

vorliegenden International Anmeldung wird ebenfalls D1 als der nachstliegende 
Stand der Technik herangezogen. 

Diese Anspruche betreffen die Verwendung einer transgenen Alge Oder Pflanze, 
insbesondere einer Olfruchtpflanze im Verfahren zur Herstellung von 
ungesattigten Fettsauren. 

Diese abhangigen Anspruche scheinen keine zusatzlichen Merkmale zu 
enthalten, welche in Kombination mit den Merkmalen der Anspruche auf die sie 
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sich beziehen, einen erfinderischen Schritt beinhalten. Die Verwendung 
transgener Pflanzen, bzw. Olfruchtpflanzen, in einem Verfahren zur Herstellung 
von ungesattigten Fettsauren ist bereits aus D2 oder D3 bekannt. 
Der Gegenstand der Anspruche 5 und 6 beruht daher nicht auf einer nach Artikel 
33(3) PCT erforderlichen erfinderischen Tatigkeit. 

Ad section VIII.: 

1 . Den Anspruchen 1 ,4 und 7-9 mangelt es an Klarheit aufgrund der Ausdrucke 
"Organismus" und "Tiere". Die Beschreibung der vorliegenden internationalen 
Anmeldung nimmt nur Bezug auf tierische Zellen, nicht jedoch auf Tiere als 
solche. Desweiteren ist es absolut notwendig klarzustellen, daB der Mensch nicht 
unter die Begriffe "Organismus" und "Tiere" fallt. 



Formblatt PCT/Beiblatt/409 (Blatt 3) (EPA-April 1997) 



(12) NACH DEM VERTRAG UBER DIE INTERNATIONALE ZUSAMMENARBEIT AUF DEM GEBIET DES 
PATENTWESENS (PCT) VEROFFENTLICHTE INTERNATIONALE ANMELDUNG 



r 



(19) Weltorganisation fur geistiges Eigentum 
Internationales Buro 

(43) Internationales Veroffentlichungsdatum 
11. Januar 2001 (11.01.2001) 




PCT 



(10) Internationale Veroffentlichungsnummer 

WO 01/02591 Al 



(51) Internationale Patentklassifikation 7 : C12N 15/82, 

9/02, 15/53, C12P 7/64, CI 1C 3/00, A01H 5/00, 13/00, 
15/00, A23L 1/30, A23K 1/16, A61K 35/78 



(21) Internationales Aktenzeichen: 

(22) Internationales An melded a turn: 

4. 

(25) Einreichungssprache: 

(26) Veroffentlichungssprache: 



PCT/EPO0/06223 



Juli 2000 (04.07.2000) 



Deutsch 



Deutsch 



(30) Angaben zur Prioritat: 

09/347,531 6. Juli 1999 (06.07.1999) US 

100 30 976.3 30. Juni 2000 (30.06.2000) DE 

(71) Anmelder (fur alle Bestimmungsstaaten mit Ausnahme 
von US): BASF AKTIENG ESELLSCHA FT [DE/DE]; 
D-67056 Ludwigshafen (DE). 

(72) Erfinder; und 

(75) Erfinder/Anmelder (nur fiir US): HEINZ, Ernst 
[DE/DE]; Puettkampsweg 13, D-22609 Hamburg (DE). 
GIRKE, Thomas [DE/US]; 16945 Vinaruz Place, San 
Diego, CA 92128 (US). SCHEFFLER, Jodi [US/US]; 51 
Country Road 228, Oxford, MS 38655 (US). DA COSTA 
E SILVA, Oswaldo [BR/US]; 203 Littleford Lane, Apex, 
NC 27502 (US). 



(74) Gemeinsamer Vertreter: BASF AKTIENGE- 
SELLSCHAFT; 67056 Ludwigshafen (DE). 

(81) Bestimmungsstaaten (national): AE, AG, AL, AM, AT, 
AU, AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CR, CU, 
CZ, DE, DK, DM, DZ, EE, ES, FI, GB, GD, GE, GH, GM, 
HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, LK, 
LR, LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, MX, 
MZ, NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, 
TJ, TM, TR, TT, TZ, UA, UG, US, UZ, VN, YU, ZA, Zw' 

(84) Bestimmungsstaaten (regional): ARIPO-Patent (GH, 
GM, KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZW), eura- 
sisches Patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
europaisches Patent (AT, BE, CH, CY, DE, DK, ES, FI, 
FR, GB, GR, IE, IT, LU, MC, NL, PT, SE), OAPI-Patent 
(BF, BJ, CF, CG, CI, CM, GA, GN, GW, ML, MR, NE, 
SN, TD, TG). 

Veroffentlicht: 

— Mit internationalem Recherchenbericht. 

— Vor Ablauf der fiir Anderungen der Anspruche geltenden 
Frist; Verqffentlichung wird wiederholt. falls Anderungen 
eintreffen. 

Zur ErJdarung der Zweibuchstaben-Codes, und der anderen 
Abkurzungen wird auf die Erklarungen ("Giudance Notes on 
Codes and Abbreviations") am Anfang jeder regularen Ausgabe 
der PCT-Gazette verwiesen. 



— (54) Title: PLANTS EXPRESSING A 6-DES ATURASE GENES AND OILS FROM THESE PLANTS CONTAINING PUFAS 
= AND METHOD FOR PRODUCING UNSATURATED FATTY ACIDS 

= (54) Bezeichnung: A 6-DES ATURASEGENE EXPRIMIERENDE PFLANZEN UND PUFAS ENTHALTENDE OLE AUS DIE 
SEN PFLANZEN UND EIN VERFAHREN ZUR HERSTELLUNG UNGESATTIGTER FETTSAUREN 

fH (57) Abstract: ^ invention relates to an improved method for producing unsaturated fatty acids and to a method for producing 
^ tnglycendes with an increased unsaturated fatty acid content. The invention also relates to the production of a transgenic organism 
preferably a transgenic plant or a transgenic micro-organism, containing increased quantities of unsaturated fatty acids, oils or lipids' 
^ with A6-double bonds as a result of the expression of a A-6-desaturase, from moss. The invention also relates to transgenic organ- 
s' isms containing a A6-desaturase gene, and to the use of the unsaturated fatty acids or triglycerides with an increased unsaturated 
ID fatty acid content produced in the method. 

(57) Zusammenfas s«ng: Die vorliegende Erfindung betrifft ein verbessertes Verfahren zur Herstellung von ungesattigten Fettsau- 
O ™ S ^ e e ^ Verfahre n zur Herstellung von Triglyceriden mit einem erhohten Gehalt an ungesattigten Fettsauren. Die Erfindung 

Obemm <he HersteUung ernes transgenen Organismuses bevorzugt einer transgenen Pflanze oder eines transgenen Mikroorganismus 
- T A ? Chalt ** FettsSuren ' ° len Oder Lipiden mit A6-Doppelbindungen aufgrund der Expression einer A-6-Desaturase aus 
^ Moos Ausserdem betnfft die Erfindung transgene Organismen, die ein A6-Desaturasegen enthalten, sowie die Verwendung der im 
^ Verfahren hergestellten ungesattigten Fettsauren bzw. Triglyceride mit einem erhohten Gehalt an ungesattigten Fettsauren 
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A6-Desaturasegene exprimierende Pflanzen und PUFAS enthaltende 
Ole aus diesen Pflanzen und ein Verfahren zur Herstellung 
ungesattigter Fettsauren 

5 

Beschreibung 

Die vorliegende Erfindung betrifft ein verbessertes Verfahren 
zur Herstellung von ungesattigten Fettsauren sowie ein Verfahren 

10 zur Herstellung von Triglyceriden mit einem erhohten Gehalt an 
ungesattigten Fettsauren. Die Erfindung betrifft die Herstellung 
eines transgenen Organismusses bevorzugt einer transgenen Pflanze 
oder eines transgenen Mikroorganismus mit erhdhtem Gehalt an 
Fettsauren, Olen oder Lipiden mit A6-Doppelbindungen aufgrund 

15 der Expression einer A-6-Desaturase aus Moos. 

AuSerdem betrifft die Erfindung transgene Organismen, die ein 
A6-Desaturasegen enthalten, sowie die Verwendung der im Verfahren 
hergestellten ungesattigten Fettsauren bzw. Triglyceride mit 
20 einem erhohten Gehalt an ungesattigten Fettsauren. 

Fettsauren und Triglyceride haben eine Vielzahl von Anwendungen 
in der Lebensmittelindustrie, der Tierernahrung, der Kosmetik und 
im Pharmabereich. Je nachdem ob es sich um freie gesattigte oder 

25 ungesattigte Fettsauren oder um Triglyceride mit einem erhohten 
Gehalt an gesattigten oder ungesattigten Fettsauren handelt, sind 
sie fur die unterschiedlichsten Anwendungen geeignet, so werden 
beispielsweise mehrfach ungesattigte Fettsauren Babynahrung zur 
Erhohung des Nahrwertes zugesetzt. Hauptsachlich werden die ver- 

30 schiedenen Fettsauren und Triglyceride aus Mikroorganismen wie 
Mortierella oder aus Ol-produzierenden Pflanzen wie Soja, Raps, 
Sonnenblume und weiteren gewonnen, wobei sie in der Regel in Form 
ihrer Triacylglyceride anf alien. Sie konnen aber auch aus Tieren 
wie Fischen gewonnen werden. Die freien Fettsauren werden vor- 

35 teilhaft durch Verseifung hergestellt. 

Je nach Anwendungszweck sind Ole mit gesattigten oder ungesattig- 
ten Fettsauren bevorzugt, so sind z.B. in der humanen Ernahrung 
Lipide mit ungesattigten Fettsauren speziell mehrfach ungesattig- 

40 ten Fettsauren bevorzugt, da sie einen positiven EinfluS auf den 
Cholesterinspiegel im Blut und damit auf die Moglichkeit einer 
Herzerkrankung haben. Auch eine positive Wirkung auf die Carcino- 
genese wird den ungesattigten Fettsauren zugeschrieben. Sie 
sind auSerdem wichtige Ausgangsstof f e fur die Synthese von 

45 Verbindungen, die wichtige biologische Vorgange innerhalb des 




WO 01/02591 PCT7EP00/06223 

2 

Organismus steuern. Sie finden deshalb in verschiedenen 
diatischen Lebensmitteln oder Medikamenten Anwendung. 

Aufgrund ihrer positiven Eigenschaf ten hat es in der Vergangen- 
5 heit nicht an Ansatzen gefehlt, Gene, die an der Synthese 

von Fettsauren bzw. Triglyceriden beteiligt sind, fur die Her- 
stellung von Olen in verschiedenen Organismen mit geandertem 
Gehalt an ungesattigten Fettsauren verfugbar zu machen. So 
wird in WO 91/13972 und seinem US-Aquivalent eine A9-Desaturase 

10 beschrieben. In WO 93/11245 wird eine A15-Desaturase in 

WO 94/11516 wird eine A12-Desaturase beansprucht . A6-Desaturasen 
werden in Girke et al . (The Plant Journal, 15, 1998: 39-48), 
Napier et al . (Biochem. J., 330, 1998: 611-614), Murata et al. 
(Biosynthesis of y-linolenic acid in cyanobacterium Spirulina 

15 patensis, pp 22-32, In: y-linolenic acid, metabolism an its roles 
in nutrition and medicine, Huang, Y. and Milles, D.E. teds.], AOC 
Press, Champaign, Illinois), Sayanova et al . (Proc. Natl. Acad. 
Sci. USA, 94, 1997: 4211-4216), WO 98/46764, Cho et al . (J. Biol. 
Chein., 274, 1999: 471-477), Aki et al . (Biochem. Biophys . Res. 

20 Commun., 255, 1999: 575-579), und Reddy et al . (Plant Mol . Biol., 
27, 1993: 293-300) beschrieben. Weitere Desaturasen werden 
beispielsweise in EP-A-0 550 162, WO 94/18337, WO 97/30582, 
WO 97/21340, WO 95/18222, EP-A-0 794 250, Stukey et al . , J. Biol. 
Chem., 265, 1990: 20144-20149, Wada et al . , Nature 347, 1990: 

25 200-203 oder Huang et al . , Lipids 34, 1999: 649-659 beschrieben. 
Weitere A6-Desaturasen werden in WO 93/06712, US 5,614,393, 
US5, 614,393, WO 96/21022, WO00/21557 und WO 99/27111 beschrieben. 
Die biochemische Charakterisierung der verschiedenen Desaturasen 
ist jedoch bisher nur unzureichend erfolgt, da die Enzyme als 

30 membrangebundene Proteine nur sehr schwer zu isolieren und 
charakterisieren sind (McKeon et al., Methods in Enzymol . 71, 
1981: 12141-12147, Wang et al., Plant Physiol. Biochem., 26, 
1988: 777-792). In der Regel erfolgt die Charakterisierung 
membrangebundene r Desaturasen durch Einbringung in einen 

35 geeigneten Organismus, der anschlieEend auf Enzymaktivitat 
mittels Edukt- und Produktanalyse untersucht wird. Die An- 
wendung zur Produktion in trans genen Organismen beschrieben 
wie in WO 98/46763 W098/46764, W098/46765. Dabei wird auch die 
Expression verschiedener Desaturasen wie in W099/64616 oder 

40 W098/46776 und Bildung polyungesattigter Fettsauren beschrieben 
und beansprucht. Beziiglich der Effektivitat der Expression von 
Desaturasen und ihren Einf lu£ auf die Bildung polyungesattigter 
Fettsauren ist anzumerken, daS durch Expression einer einzelnen 
Desaturase wie im vorgenannten Stand der Technik beschrieben 

45 lediglich geringe Gehalte an ungesattigten Fettsauren beispiels- 
weise an A-6 ungesattigten Fettsauren/Lipiden wie z.B. y-Linolen- 
saure erreicht wurden und werden. 
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Nach wie vor besteht daher ein grofier Bedarf an neuen und besser 
geeigneten Genen, die ftir Enzyme codieren, die an der Biosynthese 
ungesattigter Fettsauren beteiligt sind und es ermoglichen, diese 
in einem technischen MaSstab herzustellen. Weiterhin besteht 
5 nach wie vor ein Bedarf an verbesserten Verfahren zur Gewinnung 
moglichst hoher Gehalte an polyungesattigten Fettsauren. 

Es bestand daher die Aufgabe ein Verfahren zur Herstellung 
von ungesattigten Fettsauren unter Verwendung von Genen, die 

10 beispielsweise fur Desaturase-Enzyme codieren und die an der 
Synthese mehrfach ungesattigter Fettsauren in den Samen einer 
Olsaat beteiligt sind, bereitzustellen und so den Gehalt poly- 
ungesattigter Fettsauren zu erhohen. Diese Aufgabe wurde durch 
ein Verfahren zur Herstellung von ungesattigten Fettsauren 

15 gelost, dadurch gekennzeichnet , daS mindestens eine isoiierte 
Nukleinsauresequenz , die fur ein Polypeptid mit A6-Desaturase- 
aktivitat codiert, ausgewahlt aus der Gruppe: 

a) einer Nukleinsauresequenz mit der in SEQ ID NO: 1 dar- 
20 gesteliten Sequenz , 

b) Nukleinsauresequenzen, die sich als Ergebnis des degenerier- 
ten genetischen Codes von der in SEQ ID NO: 1 ableiten 

25 c) Derivate der in SEQ ID NO: 1 dargestellten Nukleinsaure- 
sequenz, die fur Polypeptide mit der in SEQ ID NO: 2 
dargestellten Aminosauresequenzen codieren und mindestens 
50 % Homologie auf Aminosaureebene aufweisen, ohne daS die 
enzymatische Wirkung der Polypeptide wesentlich reduziert 

30 ist, 

in einen Organismus eingebracht wird, dieser Organismus angezogen 
wird, wobei der angezogene Organismus mindestens 1 Mol-% unge- 
sattigte Fettsauren bezogen auf den gesamten Fettsauregehalt 
35 im Organismus enthalt. 

Unter Anzucht des Organismus ist die Kultivierung von Pflanzen 
ebenso zu verstehen wie die Anzucht von eukaryontischen oder 
prokaryontischen Mikroorganismen wie Bakterien, Hefen, Pilzen, 
40 Ciliaten, Algen, Cyanobakterien, tierischen oder pflanzlichen 
Zellen oder Zellverbanden oder die Anzucht von Tieren. 

Die in den im erf indungsgemaSen Verfahren gewonnenen Organismen 
enthalten in der Regel ungesattigte Fettsauren in Form von 
45 gebundenen Fettsauren, das heiSt die ungesattigten Fettsauren 
liegen iiberwiegend in Form ihrer Mono-, Di- oder Triglyceride, 
Glycolipide, Lipoproteine oder Phospholipide wie Ole oder Lipide 
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oder sonstig als Ester oder Amide gebundenen Fettsauren vor. 
Auch freie Fettsauren sind in den Organismen in Form der freien 
Fettsauren oder in Form ihrer Salze enthalten. Die freien oder 
gebundenen ungesattigten Fettsauren enthalten vorteilhaft gegen- 
5 liber den Ausgangsorganismen einen erhohten Gehalt an Fettsauren 
mit A6-Doppelbindungen wie vorteilhaft y-Linolensaure . Die durch 
Anzucht im erf indungsgemafien Verfahren gewonnenen Organismen 
und die in ihnen enthaltenen ungesattigten Fettsauren konnen 
direkt beispielsweise zur Herstellung von pharmazeutischen 

10 Zubereitungen, von Agrochemikalien, Futtermitteln oder Lebens- 
mitteln verwendet werden oder aber nach Isolierung aus den 
Organismen. Dabei konnen alle Stufen der Aufreinigung der unge- 
sattigten Fettsauren verwendet werden, das hei£t von Rohextrakten 
der Fettsauren bis zu vollstandig gereinigten Fettsauren sind fur 

15 die Herstellung der vorgenannten Produkte geeignet . In einer vor- 
teilhaften Ausf uhrungsf orm konnen die gebundenen Fettsauren aus 
beispielsweise den Olen bzw. Lipiden beispielsweise uber eine 
basische Hydrolyse z.B. mit NaOH oder KOH freigesetzt werden. 
Diese freien Fettsauren konnen direkt im erhaltenen Gemisch oder 

20 nach weiterer Aufreinigung zur Herstellung von pharmazeutischen 
Zubereitungen, von Agrochemikalien, Futtermitteln oder Lebens- 
mitteln verwendet werden. Auch konnen die gebundenen oder freien 
Fettsauren zur Umesterung oder Veresterung beispielsweise mit 
anderen Mono-, Di- oder Triglyceriden oder Glycerin verwendet 

25 werden, vim den Anteil an ungesattigten Fettsauren in diesen 
Verbindungen beispielsweise in den Triglyceriden zu erhohen. 

Ein weiterer erf indungsgemaSer Gegenstand ist ein Verfahren zur 
Herstellung von Triglyceriden mit einem erhohten Gehalt an unge- 

30 sattigten Fettsauren, indem man Triglyceride mit gesattigten oder 
ungesattigten oder gesattigten und ungesattigten Fettsauren mit 
mindestens einem der Protein, das durch die Sequenz SEQ ID NO: 2 
codiert wird, inkubiert. Vorteilhaft wird das Verfahren in Gegen- 
wart von Verbindungen durchgefiihrt , die Reduktionsaquivalente 

35 aufnehmen oder abgeben konnen. AnschlieSend konnen die Fettsauren 
aus den Triglyceriden freigesetzt werden. 

Die oben genannten Verfahren ermoglichen vorteilhaft die Synthese 
von Fettsauren oder gebundenen Fettsauren wie Triglyceriden mit 
40 einem erhohten Gehalt an Fettsauren mit A6-Doppelbindungen. 

Als Organismen fur die genannten Verfahren seien beispielhaft 
Pflanzen wie Arabidopsis, Gerste, Weizen, Roggen, Hafer, Mais, 
Soja, Reis, Baumwolle, Zuckerrxibe, Tee, Karotte, Paprika, Canola, 
45 Sonnenblume, Flachs , Hanf, Kartoffel, Triticale, Tabak, Tomate, 
Raps, Kaffee, Tapioka, Maniok, Pfeilwurz, Tagetes, Alfalfa, 
Erdnu£, Rizinus, Kokosnu£, Olpalme, Farbersaflor (Carthamus 
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tinctorius), Salat und den verschiedenen Baum-, Nu£- und Wein- 
spezies, oder Kakaobohne, Mikroorganismen wie Pilze Mortierella, 
Saprolegnia oder Pythium, Bakterien wie die Gattung Escherichia, 
Cyanobakterien, Algen oder Protozoen wie Dinof lagellaten wie 
5 Crypthecodinium genannt . Bevorzugt werden Organismen, die 

natiirlicherweise Ole in groSeren Mengen synthetisieren konnen 
wie Mikroorganismen wie Pilze wie Mortierella alpina, Pythium 
insidiosum oder Pflanzen wie Soja, Raps, KokosnuS, Olpalme, 
Canola, Farbersaflor (Carthamus tinctorius) , Rizinus, Calendula, 
10 Lein, Borretsch, Erdnu£, Kakaobohne oder Sonnenblume, besonders 
bevorzugt werden Soja, Raps oder Sonneriblume . 

Die in den Verfahren verwendeten Organismen werden je nach 
Wirtsorganismus in dem Fachmann bekannter Weise angezogen bzw. 

15 gezlichtet. Mikroorganismen wie Bakterien, Pilze, Ciliaten, 

pflanzliche oder tierische Zellen werden in der Regei in einem 
fliissigen Medium, das eine Kohlenstoff quelle meist in Form von 
Zuckern, eine Sticks toff quelle meist in Form von organischen 
Stickstof fquellen wie Hefeextrakt oder Salzen wie Ammoniumsulf at , 

20 Spurenelemente wie Eisen-, Mangan-, Magnesiumsalze und gegebenen- 
falls Vitamine enthalt, bei Temperaturen zwischen 0°C und 100°C, 
bevorzugt zwischen 10°C bis 60°C unter je nach Organismus Sauer- 
stof fbegasung oder in Abwesenheit von Sauerstoff angezogen. Dabei 
kann der pH der Nahrf lussigkeit auf einen festen Wert gehalten 

25 werden, das hei£t der pH wird wahrend der Anzucht reguliert 
oder der pH wird nicht reguliert und verandert sich wahrend 
der Anzucht. Die Anzucht kann batch weise, semi batch weise 
oder kontinuierlich erfolgen. Nahrstoffe konnen zu beginn der 
Fermentation vorgelegt oder semikontinuierlich oder kontinuier- 

30 lich nach gefuttert werden. Auch eine Anzucht auf festen Medien 
ist moglich. 

Pflanzen werden nach Transformation in der Regel zunachst 
regeneriert und anschlieSend wie ublich angezogen bzw. angebaut. 
35 Dies kann im Gewachshaus oder im Freiland erfolgen. 

Aus den Organismen werden nach Anzucht die Lipide in tiblicher- 
weise gewonnen. Hierzu konnen die Organismen nach Ernte zunachst 
aufgeschlossen werden oder direkt verwendet werden. Die Lipide 

40 werden vorteilhaft mit geeigneten Losungsmitteln wie apolare 

Losungsmittel wie Hexan oder Ethanol, Isopropanol oder Gemischen 
wie Hexan/ Isopropanol, Phenol /Chloroform/ Isoamylalkohol bei 
Temperaturen zwischen 0°C bis 80°C, bevorzugt zwischen 20°C bis 
50°C extrahiert. Die Biomasse wird in der Regel mit einem Uber- 

45 schu£ an Losxingsmittel extrahiert beispielsweise einem UberschuS 
von Losungsmittel zu Biomasse von 1:4. Das Losungsmittel wird 
anschlieSend beispielsweise uber eine Destination entf ernt . 
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Die Extraktion kann auch mit superkritischem C0 2 erfolgen. 
Nach Extraktion kann die restliche Biomasse beispielsweise 
iiber Filtration entfernt werden. 

5 Das so gewonnene Rohol kann anschlie£end weiter aufgereinigt 
werden, beispielsweise in deiti Trubungen uber das Versetzen 
mit polaren Losungsmittel wie Aceton oder Chloroform und 
anschlieSender Filtration oder Zentrif ligation entfernt werden. 
Auch eine weitere Reinigung iiber chromatographische Verfahren, 
10 Destination oder Kristallisation ist moglich. 

Zur Gewinnung der f reien Fettsauren aus den Triglyceriden werden 
diese in ublicher Weise, wie oben beschrieben, verseift. 

15 Ein weiterer Gegenstand der Erfindung sind ungesattigte Fett- 
sauren sowie Trigylceride mit einem erhohten Gehalt an unge- 
sattigten Fettsauren, die nach den oben genannten Verfahren 
hergestellt wurden, sowie deren Verwendung zur Herstellung von 
Nahrungsmitteln, Tier f utter, Kosmetika oder Pharmazeutika . Hierzu 

20 werden diese den Nahrungsmitteln, dem Tierf utter, den Kosmetika 
oder Pharmazeutika in ublichen Mengen zugesetzt . 

Im erf indungsgema£en Verfahren wurden durch Expression einer 
A6-Desaturase aus Moos in Organismen wie Pilze, Bakterien, 

25 Tieren oder Pflanzen, bevorzugt Pilzen, Bakterien und Pflanzen, 
besonders bevorzugt in Pflanzen, ganz besonders bevorzugt in 
Olfruchtpf lanzen wie Raps, Canola, Lein, Soja, Sonnenblume, 
Borretsch, Rizinus, Olpalme, Farbersaflor (Carthamus tinctorius) , 
KokosnuE, ErdnuS oder Kakaobohne hohere Gehalte an ungesattigten 

30 Fettsauren wie y-Linolensaure erhalten. Auch die Expression in 
Feldfriichten, wie Mais, Weizen, Roggen, Hafer, Triticale, Reis , 
Gerste, Alfalfa, oder Buschpf lanzen (Kaffee, Kakao, Tee) ist 
vorteilhaft. Durch die Expression eines Gens, das fur eine 
A-6-Desaturase aus Moos codiert, in den oben genannten Organismen 

35 konnen Gehalte an ungesattigten Fettsauren in den Organismen von 
mindestens 1 Mol-%, bevorzugt mindestens 3 Mol-%, besonders be- 
vorzugt mindestens 4 Mol-%, ganz besonders bevorzugt mindestens 
5 Mol-% erreicht werden. 

40 Unter Derivate(n) sind beispielsweise funktionelle Homologe 
der von SEQ ID NO: 1 codierten Enzyme oder deren enzymatischer 
Aktivitat, das heiSt Enzyme, die dieselben enzymatischen 
Reaktionen wie die von SEQ ID NO: 1 katalysieren, zu verstehen. 
Diese Gene ermoglichen ebenfalls eine vorteilhafte Herstellung 

45 von ungesattigten Fettsauren mit Doppelbindungen in A6-Position. 
Unter ungesattigten Fettsauren sind im folgenden doppelt oder 
mehrfach ungesattigte Fettsauren, die Doppelbindungen aufweisen, 
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zu verstehen. Die Doppelbindungen konnen konjugiert Oder nicht 
konjugiert sein. Die in SEQ ID NO: 1 genannte Sequenz codiert 
fur ein Enzym, das eine A6-Desaturase-Aktivitat aufweist. 

5 Das erf indungsgemaSe Enzym A6-Desaturase ftihrt vorteilhaft in 
Fettsaurereste von Glycerolipiden eine cis-Doppelbindung in 
Position C 6 -C 7 ein (siehe SEQ ID NO: 1) . Das Enzym hat auSerdem 
eine A6-Desaturase-Aktivitat , die vorteilhaft in Fettsaurereste 
von Glycerolipiden ausschlieSlich eine cis-Doppelbindung in 
10 Position C6-C7 einfiihrt. Diese Aktivitat hat auch das Enzym mit 
der in SEQ ID NO: 1 genannten Sequenz, bei dem es sich urn eine 
monofunktionelle A6-Desaturase handelt. 

Die im erf indungsgema&en Verfahren verwendeten Nukleinsaure- 
15 sequenz (en) (fur die Anmeldung soil der singular den plural 

umfassen und umgekehrt) oder Fragmente davon konnen vorteilhaft 
zur Isolierung weiterer genomischer Sequenzen liber Homologie- 
screening verwendet werden . 

20 Die genannten Derivate lassen sich beispielsweise aus anderen 
Organismen eukaryontischen Organismen wie Pflanzen wie speziell 
Moosen, Dinof lagellaten oder Pilze isolieren. 

Weiterhin sind unter Derivaten bzw. funktionellen Derivaten der 

25 in SEQ ID NO : 1 genannten Sequenz beispielsweise Allelvarianten 
zu verstehen, die mindestens 50 % Homologie auf der abgeleiteten 
Aminosaureebene , vorteilhaft mindestens 70 % Homologie, bevorzugt 
mindestens 80 % Homologie, besonders bevorzugt mindestens 85 % 
Homologie, ganz besonders bevorzugt 90 % Homologie aufweisen. 

30 Die Homologie wurde uber den gesamten Aminosaurebereich be- 

rechnet. Es wurde das Programm PileUp, BESTFIT, GAP, TRANSLATE 
bzw. BACKTRANS LATE (= Bestandteil des Programmpaketes UWGCG, 
Wisconsin Package, Version 10.0-UNIX, January 1999, Genetics 
Computer Group, Inc., Deverux et al . , Nucleic. Acid Res., 12, 

35 1984: 387-395) verwendet (J. Mol. Evolution., 25, 351-360, 1987, 
Higgins et al . , CABIOS, 5 1989: 151-153). Die von den genannten 
Nukleinsauren abgeleitete Aminosauresequenz ist Sequenz 
SEQ ID NO: 2 zu entnehmen. Unter Homologie ist Identitat zu 
verstehen, das hei£t die Aminosauresequenzen sind zu mindestens 

40 50 % identisch. Die erf indungsgema&en Sequenzen sind auf Nuklein- 
saureebene mindestens 65 % homolog, bevorzugt mindestens 70 %, 
besonders bevorzugt 75 %, ganz besonders bevorzugt mindestens 



80 %. 



45 Allelvarianten umfassen insbesondere funktionelle Varianten, die 
durch Deletion, Insertion oder Substitution von Nukleotiden aus 
der in SEQ ID NO: 1 dargestellten Sequenz erhaltlich sind, wobei 
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die enzymatische Aktivitat der abgeleiteten synthetisierten 
Proteine erhalten bleibt. 



Solche DNA-Sequenzen lass en sich ausgehend von der in 
5 SEQ ID NO: 1 beschriebenen DNA-Sequenz oder Teilen dieser 

Sequenzen, beispielsweise mit liblichen Hybridisierungsverf ahren 
oder der PCR-Technik aus anderen Eukaryonten wie beispielsweise 
den oben genannt isolieren. Diese DNA-Seqiienzen hybridisieren 
unter Standardbedingungen mit den genannten Sequenzen. Zur 

10 Hybridisierung werden vorteilhaft kurze Oligonukleotide bei- 
spielsweise der konservierten Bereiche, die iiber Vergleiche mit 
anderen Desaturasegenen in dem Fachmann bekannter Weise ermittelt 
werden konnen, verwendet. Vorteilhaft werden die Histidin-Box- 
Sequenzen verwendet . Es konnen aber auch langere Fragmente der 

15 erfindungsgema&en Nukleinsauren oder die vollstandigen Sequenzen 
fur die Hybridisierung verwendet werden. Je nach der verwendeten 
Nukleinsaure: Oligonukleotid, langeres Fragment oder vollstandige 
Sequenz oder je nachdem welche Nukleinsaureart DNA oder RNA fiir 
die Hybridisierung verwendet werden, variieren diese Standard- 

20 bedingungen. So liegen beispielsweise die Schmelztemperaturen fur 
DNA : DNA-Hybr ide ca . 10°C niedriger als die von DNA: RNA-Hybriden 
gleicher Lange . 



Unter Standardbedingungen sind beispielsweise je nach Nuklein- 

25 saure Temperaturen zwischen 42 und 58°C In einer waSrigen Puffer- 
losung mit einer Konzentration zwischen 0,1 bis 5 x SSC (1 X SSC 
= 0,15 M NaCl, 15 mM Natriumcitrat , pH 7,2) oder zusatzlich in 
Gegenwart von 5 0 % Formamid wie beispielsweise 42 °C in 5 x SSC, 
50 % Formamid zu verstehen. Vorteilhaf terweise liegen die 

30 Hybridisierungsbedingungen fur DNA : DNA-Hybr ide bei 0,1 x SSC und 
Temperaturen zwischen etwa 20°C bis 45°C, bevorzugt zwischen etwa 
30°C bis 45°C. Fur DNArRNA-Hybride liegen die Hybridisierungs- 
bedingungen vorteilhaft bei 0,1 x SSC und Temperaturen zwischen 
etwa 30°C bis 55°C, bevorzugt zwischen etwa 45°C bis 55°C. Diese 

35 angegebenen Temperaturen fiir die Hybridisierung sind beispielhaft 
kalkulierte Schmelz temper aturwerte fur eine Nukleinsaure mit 
einer Lange von ca, 100 Nukleotiden und einem G + C-Gehalt von 
50 % in Abwesenheit von Formamid. Die experimentellen Bedingungen 
fiir die DNA-Hybr idisierung sind in einschlag-igen Lehrrbiichern der 

40 Genetik wie beispielsweise Sambrook et al . , w Molecular Cloning", 
Cold Spring Harbor Laboratory, 1989, beschrieben und lassen sich 
nach dem Fachmann bekannten Formeln beispielsweise abhangig von 
der Lange der Nukleinsauren, der Art der Hybride oder dem G + 
C-Gehalt berechnen . Weitere Inf onaationen zur Hybridisierung kann 

45 der Fachmann folgenden Lehrbiichem entnehinen: Ausubel et al. 

(eds), 1985, Current Protocols in Molecular Biology, John Wiley & 
Sons, New York; Hames and Higgins (eds), 1985, Nucleic Acids 
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Hybridization: A Practical Approach, IRL Press at Oxford 
University Press, Oxford; Brown (ed) , 1991, Essential Molecular 
Biology: A Practical Approach, IRL Press at Oxford University 
Press, Oxford. 

5 

Weiterhin sind unter Derivaten Homologe der Sequenz SEQ ID No: 1 
beispielsweise eukaryontische Homologe, verkurzte Sequenzen, Ein- 
zelstrang-DNA der codierenden und nichtcodierenden DNA-Sequenz 
oder RNA der codierenden und nichtcodierenden DNA-Sequenz zu 
10 verstehen. 

AuSerdem sind unter Homologen der Sequenz SEQ ID NO: 1 Derivate 
wie beispielsweise Prornotorvarianten zu verstehen. Diese Varian- 
ten konnen durch ein oder mehrere Nukleotidaustausche , durch 
15 Insertion (en) und/ oder Deletion (en) verandert sein, ohne da£ aber 
die Funktionalitat bzw. Wirksamkeit der Promotoren beeintrachtigt 
sind. Des weiteren konnen die Promotoren durch Veranderung ihrer 
Sequenz in ihrer Wirksamkeit erhoht oder komplett durch wirk- 
samere Promotoren auch artfremder Organismen ausgetauscht werden. 

20 

Unter Derivaten sind auch vorteilhaft Varianten zu verstehen, 
deren Nukleotidsequenz im Bereich -1 bis -2000 vor dem Startcodon 
so verandert wurden, daS die Genexpression und/oder die Protein- 
expression verandert, bevorzugt erhoht wird. Weiterhin sind unter 
25 Derivaten auch Varianten zu verstehen, die am 3 ' -Ende verandert 
wurden . 

Die Nukleinsauresequenzen, die fur eine A6-Desaturase codiert, 
konnen synthetisch hergestellt oder natiirlich gewonnen sein oder 

30 eine Mischung aus synthetischen und naturlichen DNA-Bestandteilen 
enthalten, sowie aus verschiedenen heterologen A6-Desaturase-Gen- 
abschnitten verschiedener Organismen bestehen. Im allgemeinen 
werden synthetische Nukleotid-Sequenzen mit Codons erzeugt, die 
von den entsprechenden Wirt s organismen beispielsweise Pflanzen 

35 bevorzugt werden. Dies fuhrt in der Regel zu einer optimalen 
Expression der heterologen Gene. Diese von Pflanzen bevorzugten 
Codons konnen aus Codons mit der hochsten Proteinhauf igkeit 
bestimmt werden, die in den meisten interessanten Pflanzen- 
spezies exprimiert werden. Ein Beispiel fur Corynebacterium 

40 glutamicum ist gegeben in: Wada et al . (1992) Nucleic Acids Res. 
20:2111-2118). Die Durchfuhrung solcher Experimente sind mit 
Hilfe von Standardmethoden durchfuhrbar und sind dem Fachmann 
auf dem Gebiet bekannt. 

45 Funktionell aquivalente Sequenzen, die fur das A6-Desaturase~Gen 
codieren, sind solche Derivate der erf indungsgemaSen Sequenz, 
welche trotz abweichender Nukleotidsequenz noch die gewunschten 
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Funktionen, das heiSt die enzymatische Aktivitat der Proteine 
besitzen. Funktionelle Aquivalente umfassen somit natiirlich 
vorkoitimende Varianten der hierin beschriebenen Sequenzen sowie 
kunstliche, z.B. durch chemische Synthese erhaltene, an den 
5 Codon-Gebrauch einer Pflanze angepaEte, kiinstliche Nukleotid- 
Sequenzen. 

AuSerdem sind artifizielle DNA-Sequenzen geeignet, solange sie, 
wie oben beschrieben, die gewunschte Eigenschaft beispiels- 

10 weise der Erhohung des Gehaltes von A6-Doppelbindungen in Fett- 
sauren, Olen oder Lipiden in der Pflanze durch Uberexpression 
des A6-Desaturase-Gens in Kulturpf lanzen vermitteln. Solche 
artif iziellen DNA-Sequenzen konnen beispielsweise durch Ruck- 
iibersetzung mittels Molecular Modelling konstruierter Proteine, 

15 die A6-Desaturase-Aktivitat aufweisen oder durch in vitro- 
Selektion ermittelt werden. Mogliche Techniken zur in vitro- 
Evolution von DNA zur Veranderung bzw. Verbesserung der DNA- 
Sequenzen sind beschrieben bei Patten, P. A. et al . , Current 
Opinion in Biotechnology 8, 724-733 ( 1997) oder bei Moore, J.C. 

20 et al., Journal of Molecular Biology 272, 336-347 (1997). 
Besonders geeignet sind codierende DNA-Sequenzen, die durch 
Ruckiibersetzung einer Polypeptidsequenz gemaS der fur die 
Wirtspflanze spezifischen odon-Nutzung erhalten werden. 
Die spezifische Codon-Nutzung kann ein mit pf lanzengenetischen 

25 Methoden vertrauter Fachmann durch Computerauswertungen anderer, 
bekannter Gene der zu trans formierenden Pflanze leicht ermitteln. 

Als weitere geeignete aquivalente Nukleinsaure-Sequenzen sind 
zu nennen Sequenzen, welche fur Fusionsproteine codieren, wobei 

30 Bestandteil des Fusionsproteins ein A6-Desaturase-Polypeptid 
oder ein funktionell aquivalenter Teil davon ist. Der zweite 
Teil des Fusionsproteins kann z.B. ein weiteres Polypeptid mit 
enzymatischer Aktivitat sein oder eine antigene Polypeptidsequenz 
mit deren Hilf e ein Nachweis auf A6-Desaturase-Expression mog- 

35 lich ist (z.B. myc-tag oder his-tag) . Bevorzugt handelt es sich 
dabei jedoch urn eine regulative Proteinsequenz, wie z.B. ein 
Signalsequenz fur das ER, das das A6-Desaturase-Protein an den 
gewunschten Wirkort leitet. 

40 Vorteilhaft konnen die A6-Desaturase-Gene im erf indungsgemaSen 
Verfahren mit weiteren Genen der Fettsaurebiosynthese kombiniert 
werden. Beispiele fur derartige Gene sind die Acetyltransf erasen, 
weitere Desaturasen oder Elongasen ungesattigter oder gesattigter 
Fettsauren wie in WO 00/12720 beschrieben. Fur die in-vivo und 

45 speziell in-vitro Synthese ist die Kombination mit z.B. NADH- 
Cytochrom B5 ReduJctasen vorteilhaft, die Reduktionsaquivalente 
auf nehmen oder abgeben konnen. 
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Unter den im erf indungsgema£en Verfahren verwendeten Proteine 
sind Proteine zu verstehen, die eine in der Sequenz SEQ ID NO: 2 
dargestellte Aminosauresequenz oder eine daraus durch Substi- 
tution, Inversion, Insertion oder Deletion von einem oder 
5 mehreren Minosaureresten erhaltliche Sequenz enthalten, wobei 
die enzymatische Aktivitat des in SEQ ID NO: 2 dargestellten 
Proteins erhalten bleibt bzw. nicht wesentlich reduziert wird. 
Unter nicht wesentlich reduziert sind alle Enzyme zu verstehen, 
die noch mindestens 10 %, bevorzugt 20 %, besonders bevorzugt 

10 30 % der enzymatischen Aktivitat des Ausgangsenzyms aufweisen. 
Dabei konnen beispielsweise bestimmte Aminosauren durch solche 
mit ahnlichen physikochemischen Eigenschaf ten (Raumerf iillung, 
Basizitat, Hydrophobizitat etc.) ersetzt werden. Beispielsweise 
werden Argininreste gegen Lysinreste, Valinreste gegen Isoleucin- 

15 reste oder Asparaginsaurereste gegen Glutaminsaurereste aus- 
getauscht . Es konnen aber auch ein oder mehrere Aminosauren in 
ihrer Reihenfolge vertauscht, hinzugefugt oder entfernt werden, 
oder es konnen mehrere dieser Mafinahmen miteinander kombiniert 
werden . 

20 

Unter Derivaten sind auch funktionelle Aquivalente zu verstehen, 
die insbesondere auch natiirliche oder kunstliche Mutationen einer 
urspriinglich isolierten fur A6-Desaturase codierende Sequenz 
beinhalten, welche weiterhin die gewunschte Funktion zeigen, das 

25 hei£t das deren enzymatische Aktivitat nicht wesentlich reduziert 
ist. Mutationen umfassen Substitutionen, Additionen, Deletionen, 
Vertauschungen oder Insertionen eines oder mehrerer Nukleotid- 
reste. Somit werden beispielsweise auch solche Nukleotidsequenzen 
durch die vorliegende Erfindung mit umfafit, welche man durch 

30 Modifikation der A6-Desaturase Nukleotidsequenz erhalt. Ziel 
einer solchen Modifikation kann z.B. die weitere Eingrenzung 
der darin enthaltenen codierenden Sequenz oder z.B. auch die 
Einfugung weiterer Restriktionsenzym-Schnittstellen sein. 

35 Funktionelle Aquivalente sind auch solche Varianten, deren 
Funktion, verglichen mit dem Ausgangsgen bzw. Genf ragment , 
abgeschwacht (= nicht wesentlich reduziert) oder verstarkt ist 
(= Enzymaktivitat ist starker als die Aktivitat des Ausgangs- 
enzym, das heiSt Aktivitat ist hoher als 100 %, bevorzugt hoher 

40 als 110 %, besonders bevorzugt hoher als 130 %) . 

Die im erf indungsgemaSen Verfahren verwendeten oben genannten 
Nukleinsauresequenzen werden vorteilhaf t zxim Einbringen in 
einen Wirtsorganismus in eine Expressionskassette inseriert . 
45 Die Nukleinsauresequenzen konnen jedoch auch direkt in den Wirts- 
organismus eingebracht werden. Die Nukleinsauresequenz kann dabei 
vorteilhaf t beispielsweise eine DNA- oder cDNA-Sequenz sein. 



• 
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Zur Insertion in eine Expressionskassette geeignete codierende 
Sequenzen sind beispielsweise solche, die fur eine A6-Desaturase 
mit den oben beschriebenen Sequenzen codieren und die dem Wirt 
die Fahigkeit zur Uberproduktion von Fettsauren, Olen oder 
5 Lipiden mit Doppelbindungen in A6-Position verleihen. Diese 
Sequenzen konnen homologen oder heterologen Ursprungs sein. 

Unter einer Expressionskassette (= Nukleinsaurekons trukt oder 
-fragment) ist die in SEQ ID NO: 1 genannte Sequenz , die sich 

10 als Ergebnis des genetischen Codes und/oder deren f unktionellen 
oder nicht f unktionellen Derivate zu verstehen, die mit einem 
oder mehreren Regulationssignalen vorteilhaf terweise zur Erhohung 
der Genexpression funktionell verkniipft wurden und welche die 
Expression der codierenden Sequenz in der Wirtszelle steuem, 

15 Diese regulatorischen Sequenzen sollen die gezielte Expression 
der Gene und der Proteinexpression ermoglichen. Dies karm bei- 
spielsweise je nach Wirtsorganismus bedeuten, daS das Gen erst 
nach Induktion exprimiert und/oder uberexprimiert wird, oder da£ 
es sofort exprimiert und/oder uberexprimiert wird. Beispielsweise 

20 handelt es sich bei diesen regulatorischen Sequenzen um Sequenzen 
an die Induktoren oder Repressoren binden und so die Expression 
der Nukleinsaure regulieren. Zusatzlich zu diesen neuen Regu- 
lationssequenzen oder anstelle dieser Sequenzen kann die natiir- 
liche Regulation dieser Sequenzen vor den eigentlichen Struktur- 

25 genen noch vorhanden sein und gegebenenf alls genetisch verandert 
worden sein, so daS die natiirliche Regulation ausgeschaltet und 
die Expression der Gene erhoht wurde. Das Genkonstrukt kann aber 
auch einfacher aufgebaut sein, das heiEt es wurden keine zusatz- 
lichen Regulationssignale vor die Nukleinsauresequenz oder dessen 

30 Derivate inseriert und der natiirliche Promotor mit seiner Regu- 
lation wurde nicht entfernt. Stattdessen wurde die natiirliche 
Regulationssequenz so mutiert, date keine Regulation mehr erfolgt 
und/oder die Genexpression gesteigert wird. Diese veranderten 
Promotoren konnen in Form von Teilsequenzen (= Promotor mit 

35 Teilen der erf indungsgemaSen Nukleinsauresequenzen) auch allein 
vor das natiirliche Gen zur Steigerung der Aktivitat gebracht 
werden. Das Genkonstrukt kann auSerdem vorteilhaf terweise auch 
eine oder mehrere sogenannte w enhancer Sequenzen" funktionell 
verkniipft mit dem Promotor enthalten, die eine erhohte Expression 

40 der Nukleinsauresequenz ermoglichen. Auch am 3 ' -Ende der DNA- 
Sequenzen konnen zusatzliche vorteilhafte Sequenzen inseriert 
werden wie weitere regulatorische Elemente oder Terminatoren. 
Das A6-Desaturase-Gen kann in einer oder mehreren Kopien in 
der Expressionskassette (= Genkonstrukt) enthalten sein. Auch 

45 eventuell mit exprimierte Gene, die vorteilhaf t an der Fettsaure- 
biosynthese beteiligt sind, konnen in einer oder mehreren Kopien 
in der Expressionskassette vorhanden sein. 
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Die regulatorischen Sequenzen bzw. Faktoren konnen dabei wie 
oben beschrieben vorzugsweise die Genexpression der eingef iihrten 
Gene positiv beeinflussen und dadurch erhdhen. So kann eine Ver- 
starkung der regulatorischen Elemente vorteilhaf terweise auf der 
5 Transkriptionsebene erfolgen, indem starke Transkriptionssignale 
wie Promotoren und/oder "Enhancer" verwendet werden. Daneben ist 
aber auch eine Verstarkung der Translation moglich, indem bei- 
spielsweise die Stabilitat der mRNA verbessert wird. 

10 Als Promotoren in der Expressionskassette sind grundsatzlich 
alle Promotoren geeignet, die die Expression von Fremdgenen in 
Organismen vorteilhaf t in Pflanzen oder Pilzen steuern konnen. 
Vorzugsweise verwendet man insbesondere einen pflanzlichen 
Promotor oder Promotoren, die beispielsweise aus einem Pflanzen- 

15 virus entstammen. Vorteilhaf te Regulationssequenzen fur das 

erf indungsgemaSe Verfahren sind beispielsweise in Promotoren wie 
cos-, tac-, trp-, tet-, trp-tet-, lpp-/ lac-, lpp-lac-, lad<J-/ 
T7-, T5-, T3-, gal-, trc-, ara-, SP6-, oder im ^-P L - Promotor 

enthalten, die vorteilhaf terweise in gram-negativen Bakterien 

20 Anwendung finden. Weitere vorteilhaf te Regulationssequenzen sind 
beispielsweise in den gram-positiven Promotoren amy und SP02 , in 
den Hefe- oder Pilzpromotoren ADC1, MFa, AC, P-60, CYC1, GAPDH, 
TEF, rp28, ADH oder in den Pflanzenpromotoren wie CaMV/35S 
[Franck et al . , Cell 21(1980) 285-294], RUB1SC0 SSU, OCS, B33, 

25 nos (= Nopalin Synthase Promotor) oder im Ubi qui tin- Promo tor 
enthalten. Die Expressionskassette kann auch einen chemisch 
induzierbaren Promotor enthalten, durch den die Expression des 
exogenen A6-Desaturase-Gens in den Organismen vorteilhaft in den 
Pflanzen zu einem bestimmten Zeitpunkt gesteuert werden kann. 

30 Derartige vorteilhafte Pflanzenpromotoren sind beispielsweise 
der PRPl-Promotor [Ward et al . , Plant. Mol . Biol. 22 (1993), 
361-366], ein durch Benzensulf onamid-induzierbarer (EP 388186), 
ein durch Tetrazyklin-induzierbarer (Gatz et al . , (1992) Plant 
J. 2,397-404), ein durch Salizylsaure induzierbarer Promotor 

35 (WO 95/19443), ein durch Abscisinsaure-induzierbarer (EP335528) 
bzw. ein durch Ethanol- oder Cyclohexanon-induzierbarer 
(WO 93/21334) Promotor. Weitere Pflanzenpromotoren sind bei- 
spielsweise der Promotor der cytosolischen FBPase aus Kartoffel, 
der ST-LSI Promotor aus Kartoffel (Stockhaus et al . , EMBO J. 

40 8 (1989) 2445-245) , der Promotor der Phosphoribosylpyrophosphat 
Amidotransf erase aus Glycine max (siehe auch Genbank Accession 
Nummer U87999) oder ein Nodien-spezif ischen Promotor wie in 
EP 249676 konnen vorteilhaft verwandt werden. Vorteilhaft sind 
insbesondere solche pflanzliche Promotoren, die die Expression in 

45 Geweben oder Pf lanzenteilen/-organen sicherstellen, in denen die 
Fettsaurebiosynthese bzw. dessen Vorstufen stattfindet wie bei- 
spielsweise im Endosperm oder im sich entwickelnden Embryo. Ins- 
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besondere zu nennen sind vorteilhafte Promotoren, die eine samen- 
spezifische Expression gewahrleisten wie beispielsweise der USP- 
Promotor oder Derivate davon, der LEB4-Promotor , der Phaseolin- 
Promotor oder der Napin-Promotor . Der erf indungsgemaS aufgefuhrte 
5 und besonders vorteilhafte USP-Promotor oder dessen Derivate ver- 
mitteln in der Samenentwicklung eine sehr fruh Genexpression 
(Baeumlein et al . , Mol Gen Genet, 1991, 225 (3): 459-67). Weitere 
vorteilhafte samenspezif ische Promotoren, die fur monokotyle und 
dikotyle Pflanzen verwendet werden konnen, sind die fur Dikotyle 

10 geeignete Promotoren wie ebenfalls beispielhaf t ausgefiihrte 

Napingen-Promotor aus Raps (US5 , 608 , 152 ) , der Oleosin-Promotor 
aus Arabidopsis (W098/45461) , der Phaseolin-Promotor aus 
Phaseolus vulgaris (1155,504,200), der Bce4-Promotor aus Brassica 
(WO91/13980) oder der Leguminosen B4-Promotor (LeB4, Baeumlein 

15 et al., Plant J . , 2, 2, 1992: 233 - 239) oder fur Monokotyle 

geeignete Promotoren wie die Promotoren die Promotoren des lpt2- 
oder lptl-Gens aus Gerste (W095/15389 und WO95/23230) oder die 
Promotoren des Gersten Hordein-Gens , des Reis Glutelin-Gens , des 
Reis Oryzin-Gens, des Reis Prolamin-Gens , des Weizen Gliadin- 

20 Gens, des Weizen Glutelin-Gens, des Mais Zein-Gens, des Hafer 

Glutelin-Gens, des Sorghum Kasirin-Gens oder des Roggen Secalin- 
Gens, die in WO99/16890 beschrieben werden. 

Weiterhin sind insbesondere solche Promotoren bevorzugt, die 
25 die Expression in Geweben oder Pf lanzenteilen sicherstellen, in 
denen beispielsweise die Biosynthese von Fettsauren, Olen und 
Lipiden bzw. deren Vorstufen stattfindet. Insbesondere zu nennen 
sind Promotoren, die eine samenspezif ische Expression gewahr- 
leisten. Zu nennen sind der Promotor des Napin-Gens aus Raps 
30 (US 5,608,152), des USP-Promotor aus Vicia faba (USP=unbekanntes 
Samenprotein, Baeumlein et al . , Mol Gen Genet, 1991, 225 (3): 
459-67), des Oleosin-Gens aus Arabidopsis (W098/45461 ) , des 
Phaseolin-Promotors (US 5,504,200) oder der Promotor des Legumin 
B4-Gens (LeB4; Baeumlein et al . , 1992, Plant Journal, 2 (2): 
35 233-9) . Weiterhin sind zu nennen Promotoren, wie der des lpt2 
oder lptl-Gens aus Gerste (W095/15389 und WO95/23230) , die in 
monokotylen Pflanzen samenspezif ische Expression vermitteln. 

In der Expressionskassette {= Genkonstrukt , Nukleinsaurekon- 
40 strukt) konnen wie oben beschrieben noch weitere Gene, die in 
die Organismen eingebracht werden sollen, enthalten sein. Diese 
Gene konnen unter getrennter Regulation oder unter der gleichen 
Regulationsregion wie das A6-Desaturase-Gen liegen. Bei diesen 
Genen handelt es sich beispielsweise urn weitere Biosynthesegene 
45 vorteilhaft der Fettsaurebiosynthese, die eine gesteigerte 

Synthese ermoglichen. Beispielsweise seien die Gene fur die A15-, 
A12-, A9-, A5-, A4-Desaturase, die verschiedenen Hydroxylases 
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die Acyl-ACF-Thioesterasen, (3-Ketoacyl-Synthasen oder (3-Ketoacyl- 
Reductasen genannt . Vorteilhaft werden die Desaturasegene im 
Nukleinsaurekonstrukt verwendet . 

5 Prinzipiell konnen alle natiirlichen Promotoren mit ihren Regu- 
lationsseguenzen wie die oben genannten fur die erf indungsgemaSe 
Expressionskassette und das erf indungsgema£e Verfahren, wie 
unten beschrieben, verwendet werden. Dariiberhinaus konnen auch 
synthetische Promotoren vorteilhaft verwendet werden. 

10 

Es konnen verschiedene DNA-Fragmente manipuliert werden, um 
eine Nukleotid-Sequenz zu erhalten, die zweckma£igerweise in 
der korrekten Richtung gelesen wird und die mit einem korrekten 
Leseraster ausgestattet ist. Fur die Verbindung der DNA-Fragmente 
15 (= erf indungsgemafee Nukleinsauren) miteinander konnen an die 
Fragmente Adaptoren oder Linker angesetzt werden. 

ZweckmaSigerweise konnen die Promotor- und die Terminator- 
Regionen in Transkriptionsrichtung mit einem Linker oder Poly- 

20 linker, der eine oder mehrere Restriktionsstellen fur die 

Insertion dieser Sequenz enthalt, versehen werden. In der Regel 
hat der Linker 1 bis 10, meistens 1 bis 8, vorzugsweise 2 bis 
6 Restriktionsstellen. Im allgemeinen hat der Linker innerhalb 
der regulatorischen Bereiche eine GroSe von weniger als 100 bp, 

25 haufig weniger als 60 bp, mindestens jedoch 5 bp. Der Promotor 
kann sowohl nativ bzw. homolog als auch fremdartig bzw. hetero- 
log zum Wirtsorganismus beispielsweise zur Wirtspflanze sein. 
Die Expressionskassette beinhaltet in der 5 ' -3 ' -Transkriptions- 
richtung den Promotor, eine DNA-Sequenz, die fiir ein im er- 

30 f indungsgemaSen Verfahren verwendetes A6-Desaturase-Gen codiert 
und eine Region fur die transkriptionale Termination. Ver- 
schiedene Terminationsbereiche sind gegeneinander beliebig 
austauschbar . 

35 Ferner konnen Manipulationen, die passende Restriktionsschnitt- 
stellen bereitstellen oder die uberf liissige DNA oder Restrik- 
tionsschnittstellen entfernen r eingesetzt werden. Wo Insertionen, 
Deletionen oder Subs ti tut ionen wie z.B. Transitionen xind Trans- 
versionen in Frage kommen, konnen in vi tr*o-Mutagenese, -primer- 

40 repair-, Restriktion oder Ligation verwendet werden. Bei geeig- 
neten Manipulationen, wie z.B. Restriktion, -chewing-back- oder 
Auffullen von Uberhangen fiir -bluntends-, konnen komplementare 
Enden der Fragmente fur die Ligation zur Verfugung gestellt 
werden . 
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Von Bedeutung fur eine vorteilhafte hohe Expression kann u.a. 
das Anhangen des spezifischen ER-Re tent ions signals SEKDEL sein 
(Schouten, A. et al . , Plant Mbl . Biol. 30 (1996), 781-792), die 
durchschnittliche Express ionshohe wird damit verdreifacht bis 
5 vervierfacht . Es konnen auch andere Retentionssignale, die natiir- 
licherweise bei im ER lokalisierten pflanzlichen und tierischen 
Proteinen vorkommen, fur den Aufbau der Kassette eingesetzt 
werden . 

10 Bevorzugte Polyadenyiierungssignale sind pflanzliche Poly- 

adenylierungssignale, vorzugsweise solche, die im wesentlichen 
T-DNA-Polyadenylierungssignale aus Agrobacterium tumefaciens, 
insbesondere des Gens 3 der T-DNA (Octopin Synthase) des 
Ti-Plasmids pTiACHS entsprechen (Gielen et al . , EMBO J. 3 (1984), 

15 835 ff) oder entsprechende funktionelle Aquivalente. 

Die Herstellung einer Expressionskassette erfolgt durch Fusion 
eines geeigneten Promotors mit einer geeigneten A6-Desaturase- 
DNA-Sequenz sowie einem Polyadenylierungssignal nach gangigen 

20 Rekombinations- und Klonierungstechniken, wie sie beispielsweise 
in T. Maniatis, E.F . Fritsch und J. Sambrook, Molecular Cloning: 
A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring 
Harbor, NY (1989) sowie in T.J. Silhavy, M.L. Berman und L.W. 
Enquist, Experiments with Gene Fusions, Cold Spring Harbor 

25 Laboratory, Cold Spring Harbor, NY (1984) und in Ausubel, F.M. 

et al., Current Protocols in Molecular Biology, Greene Publishing 
Assoc. and Wiley- Interscience (1987) beschrieben werden. 

Die DNA Sequenz codierend fiir eine A6-Desaturase aus Phsyco- 
30 mitrella patens beinhaltet alle Sequenzmerkmale, die notwendig 
sind, um eine dem Ort der Fettsaure-, Lipid- oder Olbiosynthese 
korrekte Lokalisation zu erreichen. Daher sind keine weiteren 
Targetingsequenzen per se notwendig. Allerdings kann eine solche 
Lokalisation wunschenswert und vorteilhaft sein und daher kunst- 
35 lich verandert oder verstarkt werden, sodaS auch solche Fusions- 
konstrukte eine bevorzugte vorteilhafte Aus fuhrungs form der 
Erfindung sind. 

Insbesondere bevorzugt sind Sequenzen, die ein Targeting in 
40 Plastiden gewahrleisten. Unter bestimmten Umstanden kann auch 
ein Targeting in andere Kompartimente (referiert: Kermode, Crit. 
Rev. Plant Sci . 15, 4 (1996), 285-423) z.B. in in die Vakuole, 
in das Mitochondrium, in das Endoplasiuatische Retikulum (ER) , 
Peroxisomen, Lipidkorper oder durch ein Fehlen entsprechender 
45 operativer Sequenzen ein Verbleib im Kompartiment des Entstehens, 
dem Zytosol, wunschenswert sein. 
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Vorteilhaf terweise werden die fur A6-Desaturase-Gene codierenden 
Nukleinsauresequenzen zusammen mit mindestens einem Reportergen 
in eine Expressionskassette kloniert, die in den Organismus uber 
einen Vektor oder direkt in das Genom eingebracht wird. Dieses 
5 Reportergen sollte eine leichte Detektierbarkeit uber einen 

Wachstums-, Fluoreszenz- , Chemo-, Biolumineszenz- oder Resistenz- 
assay oder uber eine photometrische Messung ermoglichen. Bei- 
spielhaft seien als Reportergene Antibiotika-oder Herbizid- 
resis tenzgene , Hydrolasegene , Fluoreszenzproteingene , Biolumin- 

10 eszenzgene, Zucker- oder Nukleotidstof fwechselgene oder Bio- 
synthesegene wie das Ura3-Gen, das Ilv2-Gen, das Luciferasegen, 
das (5-Galactosidasegen, das gfp-Gen, das 2-Desoxyglucose-6- 
phosphat-Phosphatasegen, das P-Glucuronidase-Gen, |3-Lactamasegen, 
das Neomyc inphospho trans f eras egen, das Hygromyc inphospho trans - 

15 ferasegen oder das BASTA (= Gluphosinatresistenz ) -Gen genannt. 
Diese Gene ermoglichen eine leichte MeSbarkeit und Quantif izier- 
barkeit der Transkriptionsaktivitat und damit der Expression der 
Gene. Damit lassen sich Genomstellen identif izieren, die eine 
unterschiedliche Produktivitat zeigen. 

20 

GemaS einer bevorzugten Ausf uhrungsf orm umfafet eine Expressions- 
kassette stromauf warts, d.h. am 5 ' -Ende der codierenden Sequenz , 
einen Promotor und stromabwarts , d.h. am 3'-Ende, ein Poly- 
adenylierungssignal und gegebenenf alls weitere regulatorische 

25 Elemente, welche mit der dazwischenliegenden codierenden Sequenz 
fur die A6-Desaturase DNA Sequenz operativ verknupft sind. Unter 
einer operativen Verkniipfung versteht man die sequenzielle 
Anordnung von Promotor, codierender Sequenz, Terminator und ggf . 
weiterer regulativer Elemente derart, daS jedes der regulativen 

30 Elemente seine Funktion bei der Expression der codierenden 
Sequenz bestimmungsgemaS erfiillen kann. Die zur operativen 
Verkniipfung bevorzugten Sequenzen sind Targeting-Sequenzen zur 
Gewahrleistung der subzellularen Lokalisation in Plastiden. Aber 
auch Targeting-Sequenzen zur Gewahrleistung der subzellularen 

35 Lokalisation im Mitochondrium, im Endoplasmatischen Retikulum 

(= ER) , im Zellkern, in Olkorperchen oder anderen Kompartimenten 
sind bei Bedarf einsetzbar sowie Trans lat ions vers t&rker wie die 
5 ' -Fiihrungssequenz aus dem Tabak-Mosaik-Virus (Gallie et al . , 
Nucl. Acids Res. 15 (1987), 8693-8711). 

40 

Eine Expressionskassette kann beispielsweise einen konstitutiven 
Promotor (bevorzugt den USP- oder Napin-Promotor) , das zu 
exprimierende Gen und das ER-Retentionssignal enthalten. Als 
ER-Retentionssignal wird bevorzugt die Aminosauresequenz KDEL 
45 (Lysin, Asparaginsaure, Glutaminsaure , Leucin) verwendet. 
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Die Expressionskassette wird zur Expression in einem pro- 
karyontischen oder eukaryontischen Wirtsorganismus beispielsweise 
einem Mikroorganismus wie einem Pilz oder einer Pflanze vorteil- 
hafterweise in einen Vektor wie beispielsweise einem Plasmid, 
5 einem Phagen oder sonstiger DNA inseriert, der eine optimale 
Expression der Gene im Wirtsorganismus ermoglicht. Geeignete 
Plasmide sind beispielsweise in E. coli pLG338, pACYC184, pBR- 
Serie wie z.B. pBR322, pUC-Serie wie pUC18 oder pUC19, M113mp- 
Serie, pKC30, pRep4 , pHSl, pHS2 , P PLc236, pMBL24, pLG200, pUR290, 

10 pIN-III 113 -Bl, JLgtll oder pBdCI, in Streptomyces pIJlOl, pIJ364, 
pIJ702 oder pIJ361, in Bacillus pUBllO, pC194 oder pBD214, in 
Corynebacterium pSA77 oder pAJ667, in Pilzen pALSl, pIL2 oder 
pBB116, weitere vorteilhafte Pilzvektoren werden von Romanos, 
M. A. et al., [(1992) "Foreign gene expression in yeast: a 

15 review", Yeast 8: 423-488] und von van den Hondel, C. A. M.J. J. et 
al . [(1991) "Heterologous gene expression in filamentous fungi] 
sowie in More Gene Manipulations in Fungi [J.W. Bennet Sc L.L. 
Lasure, eds . , p. 396-428: Academic Press: San Diego] und in "Gene 
transfer systems and vector development for filamentous fungi * 

20 [van den Hondel, C. A.M. J. J. & Punt, P.J. (1991) in: Applied 

Molecular Genetics of Fungi, Peberdy, J.F. et al . , eds., p. 1-28, 
Cambridge University Press: Cambridge] beschrieben. Vorteilhafte 
Hefevektoren sind beispielsweise 2jlM, pAG-1, YEp6, YEpl3 oder 
pEMBLYe23 . Beispiele fur Algen- oder Pf lanzenpromotoren sind 

25 pLGV23, pGHlac + , pBIN!9 , pAK2004, pVKH oder pDH51 (siehe Schmidt, 
R. and Willmitzer, L. , 1988). Die oben genannten Vektoren oder 
Derivate der vorstehend genannten Vektoren stellen eine kleine 
Auswahl der moglichen Plasmide dar. Weitere Plasmide sind dem 
Fachmann wohl bekannt und konnen beispielsweise aus dem Buch 

30 Cloning Vectors (Eds. Pouwels P.H. et al . Elsevier, Amsterdam-New 
York-Oxford, 1985, ISBN 0 444 904018) entnommen werden. Geeignete 
pflanzliche Vektoren werden unter anderem in "Methods in Plant 
Molecular Biology and Biotechnology" (CRC Press), Kap. 6/7, 
S. 71-119 beschrieben. Vorteilhafte Vektoren sind sog. shuttle- 

35 Vektoren oder binare Vektoren, die in E. coli und Agrobacterium 
replizieren. 

Unter Vektoren sind auEer Plasmiden auch alle anderen dem 
Fachmann bekannten Vektoren wie beispielsweise Phagen, Viren 
40 wie SV40, CMV, Baculovirus , Adenovirus, Transposons, IS-Elemente, 
Phasmide, Phagemide, Cosmide, lineare oder zirkulare DNA zu 
verstehen. Diese Vektoren konnen autonom im Wirtsorganismus 
repliziert oder chromosomal repliziert werden, bevorzugt ist 
eine chromosomale Replikation. 
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In einer weiteren Ausgestaltungsf orm des Vektors kann die 
erf indungsgema£e Expressionskassette auch vorteilhaf terweise in 
Form einer linearen DNA in die Organismen eingefuhrt werden und 
iiber heterologe oder homologe Rekombination in das Genom des 
5 Wirtsorganismus integriert werden. Diese lineare DNA kann aus 

einem linearisierten Plasmid oder nur aus der Expressionskassette 
als Vektor oder den erf indungsgemaSen Nukleinsauresequenzen 
bestehen. 



10 In einer weiteren vorteilhaf ten Aus fuhrungs form kann die 
erf indungsgematee Nukleinsaureseguenz auch alleine in einen 
Organismus eingebracht werden. 

Sollen neben der erf indungsgemaEen Nukleinsaureseguenz weitere 
15 Gene in den Organismus eingefuhrt werden, so konnen alle zusammen 
mit einem Report ergen in einem einzigen Vektor oder jedes ein- 
zelne Gen mit einem Reportergen in je einem Vektor oder mehrere 
Gene zusammen in verschiedenen Vektoren in den Organismus ein- 
gebracht werden, wobei die verschiedenen Vektoren gleichzeitig 
20 oder sukzessive eingebracht werden konnen. 

Der Vektor enthalt vorteilhaft mindestens eine Kopie der Nuklein- 
sauresequenzen, die fur eine A6-Desaturase codieren, und/ oder der 
Expressionskassette . 

25 

Beispielhaft kann die pflanzliche Expressionskassette in den 
Transformationsvektor pRT ((a) Toepfer et al . , 1993, Methods 
Enzymol., 217: 66-78; (b) Toepfer et al. 1987, Nucl. Acids. 
Res. 15: 5890 ff.) eingebaut werden. 

30 

Alternativ kann ein rekombinanter Vektor ( = Expressionsvektor ) 
auch in-vitro transkribiert und translatiert werden, z.B. durch 
Nutzung des T7 Promotors und der T7 RNA Polymerase. 

35 In Prokaryoten verwendete Expressionsvektoren nutzen haufig 

induzierbare Systeme mit und ohne Fusionsproteinen bzw Fusions- 
oligopeptiden, wobei diese Fusionen sowohl N- terminal als auch 
C-terminal oder anderen nutzbaren Domanen eines Proteins erfolgen 
konnen. Solche Fusionsvektoren dienen in der Regel dazu: i.) die 

40 Expressionsrate der RNA zu erhohen ii.) die erzielbare Protein- 
syntheserate zu erhohen, iii.) die Loslichkeit des Proteins zu 
erhohen, iv. ) oder die Reinigung durch einen fur die Af finitats- 
chromatographie nutzbare Bindesequenz zu vereinf achen. Haufig 
werden auch proteolytische Spaltstellen uber Fusionsproteine 

45 eingeftihrt, was die Abspaltung eines Teils des Fusionsproteins 
auch der Reinigung ermoglicht. Solche Erkennungssequenzen fur 
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Proteasen erkennen sind z.B. Faktor Xa, Thrombin und Entero- 
kinase . 

Typische vorteilhafte Fusions- und Express ionsvektoren sind pGEX 
5 [Pharmacia Biotech Inc; Smith, D.B. and Johnson, K.S. (1988) Gene 
67: 31-40], pMAL (New England Biolabs, Beverly, MA) and pRIT5 
(Pharmacia, Piscataway, NJ) welches Glutathion S-transf erase 
beinhaltet (GST) , Maltose Bindeprotein, oder Protein A. 

10 Weitere Beispiele fur E. coli Expressionsvektoren sind pTrc 

[Amann et al . , (1988) Gene 69:301-315] und pET Vektoren [Studier 
et al., Gene Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, California (1990) 60-89; Stratagene, 
Amsterdam, Niederlande] . 

15 

Weitere vorteilhafte Vektoren zur Verwendung in Hefe sind 
pYepSecl (Baldari, et al . , (1987) Embo J. 6:229-234), pMFa 
(Kurjan and Herskowitz, (1982) Cell 30 : 933-943 ) , pJRY88 (Schultz 
et al., (1987) Gene 54:113-123), and pYES-Derivate (Invitrogen 

20 Corporation, San Diego, CA) . Vektoren fiir die Nutzung in fila- 

mentosen Pilzen sind beschrieben in: van den Hondel, C .A.M. J. J. & 
Punt, P.J. (1991) "Gene transfer systems and vector development 
for filamentous fungi, in: Applied Molecular Genetics of Fungi, 
J.F. Peberdy, et al . , eds . , p. 1-28, Cambridge University Press: 

25 Cambridge. 

Alternativ konnen auch vorteilhaft Insektenzellexpressionsvekto- 
ren genutzt werden z.B. fiir die Expression in Sf 9 Zellen. Dies 
sind z.B. die Vektoren der pAc Serie (Smith et al. (1983) Mol . 
30 Cell Biol. 3:2156-2165) und der pVL series (Lucklow and Summers 
(1989) Virology 170:31-39) . 

Des weiteren konnen zur Genexpression vorteilhaft Pf lanzen- 
zellen oder Algenzellen genutzt werden. Beispiele fiir Pflanzen- 
35 expressionsvektoren finden sich in Becker, D., et al . (1992) 
"New plant binary vectors with selectable markers located 
proximal to the left border", Plant Mol. Biol. 20: 1195-1197 
oder in Bevan, M.W. (1984) "Binary AgroJbactejrium vectors for 
plant transformation", Nucl. Acid. Res. 12: 8711-8721. 

40 

Weiterhin konnen die fiir die A6-Desaturase codierenden Nuklein- 
sauresequenzen in Saugerzellen exprimiert werden. Beispiel 
fiir entsprechende Expressionsvektoren sind pCDM8 und pMT2PC 
genannt in: Seed, B. (1987) Nature 329:840 oder Kaufman et al. 
45 (1987) EMBO J. 6: 187-195). Dabei sind vorzugsweise zu nutzende 
Promotoren viralen Ursprungs wie z.B. Promotoren des Polyoma, 
Adenovirus 2, Cytomegalovirus oder Simian Virus 40. Weitere 
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prokaryotische und eukaryotische Expressionssysteme sind genannt 
in Kapitel 16 und 17 in Sambrook et al . , Molecular Cloning: 
A Laboratory Manual. 2nd, ed. , Cold Spring Harbor Laboratory, 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
5 1989. 



Das Einbringen der erf indungsgemaSen Nukleinsauren, der 
Expressionskassette oder des Vektors in Organismen beispiels- 
weise in Pflanzen kann prinzipiell nach alien dem Fachmann 
10 bekannten Methoden erfolgen. 

Fur Mikroorganismen kann der Fachmann entsprechende Methoden den 
Lehrbuchern von Sambrook, J. et al . (1989) Molecular cloning: 
A laboratory manual, Cold Spring Harbor Laboratory Press, von 

15 F.M. Ausubel et al . (1994) Current protocols in molecular bio- 
logy, John Wiley and Sons, von D.M. Glover et al., DNA Cloning 
Vol.1, (1995), IRL Press (ISBN 019-963476-9), von Kaiser et al . 
(1994) Methods in Yeast Genetics, Cold Spring Habor Laboratory 
Press oder Guthrie et al . Guide to Yeast Genetics and Molecular 

20 Biology, Methods in Enzymology, 1994, Academic Press entnehmen. 

Die Ubertragung von Fremdgenen in das Genom einer Pflanze wird 
als Transformation bezeichnet. Es werden dabei die beschriebenen 
Methoden zur Transformation und Regeneration von Pflanzen aus 

25 Pf lanzengeweben oder Pf lanzenzellen zur transienten oder stabilen 
Transformation genutzt. Geeignete Methoden sind die Protoplasten- 
transf ormation durch Polyethylenglykol-induzierte DNA-Aufnahme, 
das biolistische Verfahren mit der Genkanone - die sogenannte 
particle bombardment Methode die Elektroporation, die Inku- 

30 bation trockener Embryonen in DNA-hal tiger Losung, die Mikro- 
injektion und der durch Agrobacterium vermittelte Gentransfer. 
Die genannten Verfahren sind beispielsweise in B. Jenes et al . , 
Techniques for Gene Transfer, in: Transgenic Plants, Vol. 1, 
Engineering and Utilization, herausgegeben von S.D. Kung und R. 

35 Wu, Academic Press (1993) 128-143 sowie in Potrykus Annu. Rev. 
Plant Physiol. Plant Molec . Biol. 42 (1991) 205-225) beschrieben. 
Vorzugsweise wird das zu exprimierende Konstrukt in einen Vektor 
kloniert, der geeignet ist, Agrobacterium tumefaciens zu trans- 
formieren, beispielsweise pBinl9 (Bevan et al., Nucl. Acids Res. 

40 12 (1984) 8711) . Mit einem solchen Vektor trans formierte Agro- 
bakterien konnen dann in bekannter Weise zur Transformation von 
Pflanzen, insbesondere von Kulturpf lanzen, wie z.B. von Tabak- 
pflanzen, verwendet werden, indem beispielsweise verwundete 
Blatter oder Blattstiicke in einer Agrobakterienlosung gebadet 

45 und anschlie£end in geeigneten Medien kultiviert werden. Die 
Transformation von Pflanzen mit Agrobacterium tumefaciens wird 
beispielsweise von Hofgen und Willmitzer in Nucl. Acid Res. 




PCT7EPOO/06223 



WO 01/02591 



22 



(1988) 16, 9877 beschrieben oder ist unter anderem bekannt aus 
F.F. White, Vectors for Gene Transfer in Higher Plants; in Trans- 
genic Plants, Vol. 1, Engineering and Utilization, herausgegeben 
von S.D. Kung und R. Wu, Academic Press, 19 93, S. 15-3 8. 

5 

Mit einem wie oben beschriebenen Expressionsvektor transf ormierte 
Agrobakterien konnen ebenfalls in bekannter Weise zur Trans- 
formation von Pflanzen wie Testpflanzen wie Arabidopsis oder 
Kulturpf lanzen wie Getreide, Mais, Hafer, Roggen, Gerste, Weizen, 

10 Soja, Reis, Baumwolle, Zuckerrube, Canola, Triticale, Reis, 
Sonnenblume , Flachs , Hanf, Kartoffel, Tabak, Tomate, Kaffee, 
Kakao, Tee, Karotte, Paprika, Raps, Tapioka, Maniok, Pfeilwurz, 
Tagetes, Alfalfa, Salat und den verschiedenen Baum-, Nu£- 
und Weinspezies, insbesondere von 01-haltigen Kulturpf lanzen, 

15 wie Soja, Erdnu&, Rizinus, Borretsch, Lein, Sonnenblume, 

Canola, Baumwolle, Flachs, Raps, KokosnuS, Olpalme, Farbersaflor 
(Carthamus tinctorius) oder Kakaobohne verwendet werden, z.B. 
indem verwundete Blatter oder Blattstucke in einer Agrobakterien- 
losung gebadet und anschliefiend in geeigneten Medien kultiviert 

20 werden. 

Die genetisch veranderten Pf lanzenzellen konnen iiber alle dem 
Fachmann bekannten Methoden regeneriert werden. Entsprechende 
Methoden k6nnen den oben genannten Schriften von S.D. Kung und 
25 R. Wu, Potrykus oder Hofgen und Willmitzer entnoitimen werden. 

Als Organismen bzw. Wirtsorganismen fur die erf indungsgemaSen 
Verfahren verwendeten Nukleinsauren, die verwendete Expressions- 
kassette oder den verwendeten Vektor eignen sich prinzipiell 

30 vorteilhaft alle Organismen, die in der Lage sind Fettsauren 
speziell ungesattigte Fettsauren zu synthetisieren bzw. fur die 
Expression rekombinanter Gene geeignet sind. Beispielhaft seien 
Pflanzen wie Arabidopsis, Asteraceae wie Calendula oder Kultur- 
pf lanzen wie Soja, ErdnuS, Rizinus, Sonnenblume, Mais, Baum- 

35 wolle, Flachs, Raps, Kokosnufi, Olpalme, Farbersaflor (Carthamus 
tinctorius) oder Kakaobohne, Mikroorganismen wie Pilze bei- 
spielsweise die Gattung Mortierella, Saprolegnia oder Pythium, 
Bakterien wie die Gattung Escherichia, Cyanobakterien, Ciliaten, 
Thrausto- oder Schizichytrien, Algen oder Protozoen wie Dino- 

40 f lagellaten wie Crypthecodinium genannt . Bevorzugt werden 
Organismen, die naturlicherweise Ole in groSeren Mengen 
synthetisieren konnen wie Pilze der Gattungen Mortierella oder 
Pythium wie Mortierella alpina, Pythium insidiosum oder Pflanzen 
wie Soja, Raps, KokosnuS, Olpalme, Farbersaflor, Rizinus, 

45 Calendula, ErdnuS, Kakaobohne oder Sonnenblume, besonders bevor- 
zugtr werden Soja, Raps, Sonnenblume, Rizinus, Mortierella oder 
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Pythium. Prinzipiell sind als Wirtsorganismen auch trahsgene 
Tiere geeignet beispielsweise C. elegans . 

Nutzbare Wirtszellen sind weiterhin genannt in: Goeddel, Gene 
5 Expression Technology: Methods in Enzymology 185, Academic Press, 
San Diego, CA (1990) . 

Verwendbare Expressionsstainme z.B. solche, die eine geringere 
Proteaseaktivitat aufweisen sind beschrieben in: Gottesman, S., 
10 Gene Expression Technology: Methods in Enzymology 185, Academic 
Press, San Diego, California (1990) 119-128. 

Dabei kann je nach Wahl des Promotors die Expression des 
A6-Desaturase-Gens spezifisch in den Blattern, in den Samen, den 

15 Knollen oder anderen Teilen der Pflanze erfolgen. Solche Fett- 

sauren, Ole oder Lipide mit A6-Doppelbindungen iiberproduzierenden 
transgenen Pflanzen, deren Vermehrungsgut , sowie deren Pflanzen- 
zellen, -gewebe oder -teile, sind ein weiterer Gegenstand der 
vorliegenden Erfindung. Ein bevorzugter erf indungsgemaSer Gegen- 

20 stand sind transgene Pflanzen beispielsweise Kulturpf lanzen wie 
Mais, Hafer, Roggen, Weizen, Gerste, Mais, Reis, Soja, Zucker- 
rube, Canola, Triticale, Sonnenblume , Flachs, Hanf, Tabak, 
Tomate, Kaffee, Kakao, Tee, Karotte, Paprika, Raps, Tapioka, 
Maniok, Pfeilwurz, Tagetes, Alfalfa, Salat und den verschiedenen 

25 Baum-, Nu£- und Weinspezies, Kartoffel, insbesondere 01-haltige 
Kulturpf lanzen, wie Soja, ErdnuS, Rizinus, Borretsch, Lein, 
Sonnenblume, Canola, Baumwolle, Flachs, Raps, KokosnuS, Slpalme, 
Farbersaflor (Carthamus tinctorius) oder Kakaobohne, Testpf lanzen 
wie Arabidopsis oder sonstige Pflanzen wie Moose oder Algen exit- 

30 haltend eine erf indungsgemaSe funktionelle Nukleinsauresequenz 
oder eine funktionelle Expressionskassette. Funktionell bedeutet 
hierbei, daS ein enzymatisch aktives Enzym gebildet wird. 

Die Expressionskassette oder die erf indungsgemafien Nukleinsaure- 
35 seguenzen enthaltend eine A6-Desaturasegensequenz kann dariiber 
hinaus auch zur Transformation der oben beispielhaft genannten 
Organismen wie Bakterien, Cyanobakterien, filamentosen Pilzen, 
Ciliaten, Tiere oder Algen mit dem Ziel einer Erhohung des 
Gehaltes an Fettsauren, Olen oder Lipiden A6-Doppelbindungen ein- 
40 gesetzt werden. Bevorzugte transgene Organismen sind Bakterien, 
Cyanobakterien, filamentose Pilze oder Algen. 

Unter transgenen Organismen sind Organismen zu verstehen, die 
eine Fremde aus einem anderen Organismus stammende Nuklein- 
45 saure, die fur eine im erf indungsgemafien Verfahren verwendete 
A6-Desaturase codiert, enthalten. Unter transgenen Organismen 
sind auch Organismen zu verstehen, die eine Nukleinsaure, die 
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aus demselben Organismus stammt und die fur eine A6-Desaturase 
codiert, enthalt, wobei diese Nukleinsaure als zusatzliche 
Genkopie enthalten ist oder nicht in der naturlichen Nuklein- 
saureumgebung des A6-Desaturase-Gens enthalten ist. Transgene 
5 Organismen sind auch Organismen bei denen die natiirliche 3'- 
und/oder 5 ' -Region des A6-Desaturase-Gens durch gezielte gen- 
technologische Veranderungen gegeniiber dem Ausgangsorganismus 
verandert wurde. Bevorzugt sind transgene Organismen bei denen 
eine Fremd-DNA eingebracht wurde. Besonders bevorzugt sind trans- 
10 gene Pflanzen, in die Fremd-DNA eingebracht wurde. Unter trans- 
genen Pflanzen sind einzelne Pf lanzenzellen und deren Kulturen 
wie beispielsweise Kalluskulturen auf Festmedien oder in Flussig- 
kultur, Pf lanzenteile und ganze Pflanzen zu verstehen. 

15 Ein weiterer Erf indungsgegenstand sind transgene Organismen 
ausgewahlt aus der Gruppe Pflanzen, Pilze, Ciliaten, Algen, 
Bakterien, Cyanobakterien oder Tiere, bevorzugt transgene 
Pflanzen oder Algen, die mindestens eine isolierte Nukleinsaure- 
sequenz enthalten, die fur ein Polypeptid mit A6-Desaturase- 

20 aktivitat codiert, ausgewahlt aus der Gruppe: 



a) einer Nukleinsauresequenz mit der in SEQ ID NO: 1 dar- 
gestellten Sequenz, 

25 b) Nukleinsauresequenzen, die sich als Ergebnis des degenerier- 
ten genetischen Codes von der in SEQ ID NO: 1 ableiten 

c) Derivate der in SEQ ID NO: 1 dargestellten Nukleinsaure- 
sequenz, die fur Polypeptide mit der in SEQ ID NO: 2 
30 dargestellten Aminosauresequenzen codieren und mindestens 

50 % Homologie auf Aminosaureebene aufweisen, ohne da£ die 
enzymatische Wirkung der Polypeptide wesentlich reduziert 
ist. 



35 Erhohung des Gehaltes von Fettsauren, Olen oder Lipiden mit 

A6-Doppelbindungen bedeutet im Rahmen der vorliegenden Erfindung 
beispielsweise die kiinstlich erworbene Fahigkeit einer erhohten 
Biosyntheseleistung durch funktionelle Uberexpression des 
A6-Desaturase-Gens in den erf indungsgemaSen Organismen vorteil- 

40 haft in den erf indungsgemaSen transgenen Pflanzen gegeniiber den 
nicht gentechnisch modif izierten Ausgangspf lanzen zumindest fiir 
die Dauer mindestens einer Pf lanzengeneration. 

Der Biosyntheseort von Fettsauren, Olen oder Lipiden beispiels- 
45 weise ist im allgemeinen der Samen oder Zellschichten des Samens, 
so da£ eine samenspezif ische Expression des A6-Desaturase-Gens 
sinnvoll ist. Es ist jedoch naheliegend, daS die Biosynthese 
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von Fettsauren, Olen oder Lipiden nicht auf das Samengewebe 
beschrankt sein mu£, sondern auch in alien ubrigen Teilen der 
Pflanze - beispielsweise in Epidermis zellen oder in den Knollen 
-gewebe spezifisch erfolgen kann. 

5 

Daruberhinaus ist eine konstitutive Expression des exogenen 
A6-Desaturase-Gens von Vorteil. Andererseits kann aber auch 
eine induzierbare Expression wunschenswert erscheinen. 

10 Die Wirksamkeit der Expression des A6-Desaturase-Gens kann 

beispielsweise in vitro durch SproSmeristemvermehrung ermittelt 
werden. Zudem kann eine in Art und Hohe veranderte Expression 
des A6-Desaturase-Gens und deren Auswirkung auf die Fettsaure-, 
01- oder Lipidbiosyntheseleistung an Testpflanzen in Gewachshaus- 

15 versuchen getestet werden. 

Gegenstand der Erf indung sind wie oben beschrieben transgene 
Pflanzen, trans formiert mit einer Nukleinsauresequenz , die fiir 
eine A6-Desaturase codiert, einem Vektor oder einer Expressions- 
20 kassette enthaltend eine A6-Desaturase-Gensequenz oder mit dieser 
hybridisierende DNA-Sequenzen, sowie transgene Zellen, Gewebe, 
Teile und Vermehrungsgut solcher Pflanzen. Besonders bevorzugt 
sind dabei transgene Kulturpf lanzen wie oben beschrieben. 

25 Pflanzen im Sinne der Erfindung sind mono- und dikotyle Pflanzen 
oder Algen. 



Weitere Gegenstande der Erfindung sind: 



30 - Verwendung einer A6-Desaturase~DNA-Gensequenz mit der in 

SEQ ID N0:1 genannten Sequenz oder mit dieser hybridisierende 
DNA-Secjuenzen zur Herstellung von Pilzen, Bakterien, Tieren 
oder Pflanzen bevorzugt Pflanzen mit erhdhtem Gehalt an 
Fettsauren, Olen oder Lipiden mit A6-Doppelbindungen durch 

35 Expression dieser A6-Desaturase DNA-Sequenz in Pflanzen. 



Verwendung der Proteine mit den Sequenzen SEQ ID NO: 2 zur 
Herstellung von ungesattigten Fettsauren in Pflanzen, Pilzen 7 
Bakterien oder Tieren bevorzugt Pflanzen. 



45 
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Die Erfindung wird durch die folgenden Beispiele naher erlautert : 
Beispiele 

5 Beispiel 1: Allgemeine Klonierungsverf ahren und Anzuchts- 
verfahren: 

Die Klonierungsverfahren wie z.B. Restriktionsspaltungen, 
Agarose-Gelelektrophorese, Reinigung von DNA- Fragment en, Transfer 

10 von Nukleinsauren auf Nitrozellulose und Nylon Membranen, Ver- 
knupfen von DNA-Fragmenten, Transformation von Escherichia coli 
Zellen, Anzucht von Organismen und die Sequenzanalyse rekombinan- 
ter DNA wurden wie bei Sambrook et al . (1989) (Cold Spring Harbor 
Laboratory Press: ISBN 0-87969-309-6) beschrieben durchgef uhrt . 

15 Das Protonema von Physcomitrella patens (= P. patens) wurde in 
Fliissigmedium, wie von Reski et al . (Mol. Gen. Genet., 244, 1994: 
352-359) beschrieben, angezogen. 

Beispiel 2: Sequenzanalyse rekombinanter DNA 

20 

Die Sequenzierung rekombinanter DNA-Molekule erfolgte mit einem 
Laserf luoreszenz-DNA-Sequenzierer der Firma ABI nach der Methode 
von Sanger (Sanger et al. (1977) Proc. Natl. Acad. Sci . USA74, 
5463-5467) . Fragmente resultierend aus einer Polymerase Ketten- 
25 reaktion wurden zur Vermeidung von Polymerasef ehlern in zu 
exprimierenden Konstrukten sequenziert und iiberpruf t . 

Beispiel 3: Lipidanalyse aus dem Protonema von P. patens und 
aus Hefezellen 

30 

Die Lipide wurden mit Chlorof orm/Methanol wie bei Siebertz 

et al. (Eur. J. Biochem. , 101, 1979: 429-438) beschrieben 

aus dem Protonema von S. patens oder aus Hefezellen extrahiert 

und iiber Diinnschichtchromatographie (= TLC) mit Diethylether ge- 

35 reinigt. Die erhaltenen Fettsauren wurden zu den entsprechenden 
Methylestern transmethyliert und mit Gaschromatographie (= GO 
analysiert. Die verschiedenen Methylester wurden mit den ent- 
sprechenden Standards identif iziert . Entsprechende Fettsaure- 
pyrrolididen wurden, wie bei Anderson et al . (Lipids, 9, 1974: 

40 185-190) beschrieben, erhalten und mit GC-MS bestimmt. 



45 
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Funktionelle Expression der A6-Desaturase cDNA von 
P. patens in Hefen 



Die Expression-Experimente in Hefen wurden mit PPDES6-cDNA durch- 
5 gefiihrt. Knock-out-Exprimente batten gezeigt (Daten und Versuchs- 
durchfiihrung nicht gezeigt bzw. beschrieben) , daS der Knock-out 
zu einem Verlust an 2 0 : 3 11 • 14 ' 17 - , 20:4 s ' 8 ' 11 ' 14 -, 20:4 s ' 11 ' 14 ' 17 - und 
20:5 5 ' 8 ' 11 ' 14 < 17 -Fettsauren f tihrt . Gleichzeitig steigen die 18 :2 9 ' 12 - 
und 18 : 3 9 ' 12 ' 15 -Fettsauren an. Fur. die Expression in Hefe wurde 

10 der PPDES6-CDNA in den Hef e-Expressionsvektor pYES2 (Invitrogen) 
subkloniert. Der erhaltene Vektor erhielt die Bezeichnung pYES- 
delta6. Mit pYES2 (Kontrolle) und pYESdelta6 ( A6-Desaturase-cDNA) 
trans formierte Hefekulturen wurden auf Uracil-dop-out Medium mit 
2 % Raffinose und 1 % Tergitol NP-40 (zur Stabilisierung der 

15 Fettsauren) angezogen. Fur die Expression wurden die Zellen mit 
Galactose (Endkonzentration 2 %) bis zu einer optischen Dichte 
(= OD) von 0,5 bei 600nm angezogen. In Fiitterungsexperimenten 
wurden Fettsauren in 5 % Tergitol solubilisiert und mit einer 
Endkonzentration von 0,0003 % zugesetzt. Die Ergebnisse der 

20 Expression sind Tabelle I zu entnehmen. Die Synthese von Fett- 
sauren mit einer Doppelbindung an Position 6 ist nur in Gegenwart 
des Expressionskonstrukts mit der A6-Desaturase-cDNA moglich. 
Dieses A6-Desaturase-Enzym hat eine groSere Aktivitat gegenuber 
Fettsauren, die schon eine Doppelbindung an Position 9 oder 12 

25 (Bezug auf Kohlenstof f atom in der Kette) enthalten. Es wurden 
die Fettsauremethylester des gesamten Lipids der Hefen mit GC 
analysiert. Die einzelnen synthetisierten Fettsauren werden in 
der Tabelle in Mol-% der gesamten Fettsauren angegeben. 



30 Tabelle I: 



Fettsaurezusammensetzung in transf ormierten Hefen 
gegemiber der Kontrolle 



Gesamt Fettsaure (%) 




PYES2 




pYESdelta6 




Fettsauren 






+ 18:29.12 


+18:39.12,15 


16:0 


16,4 


16,1 


23,8 


25,8 


16:19 


54,0 
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4,0 
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8,5 




18:36,9,12 






4,0 




1 8:3 9,12,15 








11,7 


18:46,9,12,15 








3,0 



35 



40 



45 



WO 01/02591 

Beispiel 5: 



28 

Transformation von P. patens 



PCT7EP00/06223 



Die Polyethylenglycol vermittelte direkte DNA-Transf ormation von 
Protoplasten wurde, wie von Schafer et al. (Mol. Gen. Genet., 
5 226, 1991: 418-424) beschrieben, durchgef iihrt . Die Selektion 
der Trans fonrianten erfolgte auf G418-enthaltenden Medium (Girke 
et al., The Plant Journal, 15, 1998: 39-48). 

Beispiel 6 : Isolierung von A6-Desaturase cDNA und genomischen 
10 Clonen von P. patens 

Mit Hilfe eines PCR-Ansatzes mit den folgenden degenerierten 
Oligonukleotiden als Primer: 

15 A: TGGTGGAA (A/G) TGGA (C/A) ICA (T/C) AA und 

B: GG(A/G) AA ( A/C/G/T ) A ( A/G) (G/A) TG (G/A) TG (C/T) TC] 

und dem folgenden Temper aturprogramm: 

94°C, 3 min; [94°C, 20 sec; 45°C, 30 sec; 72°C, 1 min] , 30 Zyklen; 

20 72°C, 5 min, wurden schlieSlich Fragmente einer A6-Desaturase-Gen 
kloniert. Fur die Klonierung wurde poly(A)RNA aus 12 Tage alten 
P. patens Protonema-Kultur isoliert. Mit dieser poly(A)RNA wurde 
die oben beschriebene PCR durchgef iihrt . Fragmente der erwarteten 
Fragmentlange (500 bis 600 bp) wurden in pUC18 kloniert und 

25 sequenziert . Die abgeleitete Aminosauresequenz eines PCR-Frag- 

ments zeigte Ahnlichkeiten zu bekannten A6-Desaturasen. Da bekannt 
war, daS P. patens eine A6-Desaturase besitzt, wurde angenommen, 
da£ dieser Klon fur einen Teil einer A6-Desaturase codiert. 

30 Ein vollstandiger cDNA-Klon (= PPDES6-CDNA) wurde aus einer 

P. patens c DMA- Bank von 12 Tage alten Protonemata mit Hilfe des 
oben genannten PCR-Fragments isoliert. Die Nukleotidsequenz wird 
in SEQ ID NO:l wiedergegeben. Die abgeleitete Aminosauresequenz 
ist SEQ ID NO: 2 zu entnehmen. Die zugehorige genomische Sequenz 

35 (= PPDES6-Gen) konnte mit Hilfe der PCR und den folgenden Oligo- 
nukleotiden als Primer isoliert werden: 

C: CCGAGTCGCGGATCAGCC 

D: CAGTACATTCGGTCATTCACC : 

40 

Tabelle II gibt die Ergebnisse des Vergleichs zwischen der neuen 
P. patens A6-Desaturase iiber die gesamte Nukleinsauresequenz mit 
folgenden bekannten A6-Desaturase wieder: Borago officinalis 
(U79010) , Synechocystis sp (L11421) , Spirulina platensis 
45 (X87094) , Caenorhabiditis elegans (AF031477), Mortierella alpina 
(WO 98/46764), Homo sapiens (Cho et al . , J. Biol. Chem. , 274, 
1999: 471-477), Rattus norvegicus (AB021980) und Mus musculus 
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(Cho et al., J . Biol. Chem. , 274, 1999: 471-477). Die Analyse 
wurde mit dem Gap Programm (GCG-Package, Version 9,1) und den 
folgenden Analysenparametern durchgefiihrt : scoring matrix, 
blosum62, gap creation penalty, 12; gap extension penalty, 4. 
Die Ergebnisse geben die bestimmte Identitat oder Ahnlichkeit [] 
in Prozent (%) izn Vergleich zur P. patens-Sequenz wieder. 



Tabelle II: Sequenzvergleich zwischen P. patens A6-Desaturase und 
anderen A6-Desaturasen 

10 



15 



Sequenz 


Aminosauresequenz- Identitat 
[Ahnlichkeit] (%) 


Borago officinalis 


31 [38] 


Synechocystis sp. 


21 [29] 


Spirulina platensis 


20 [29] 


Caenorhabditis elegans 


35 [43] 


Mortierella alpina 


39 [47] 


Homo sapiens 


27 [38] 


Rattus norvegicus 


28 [39] 


Mus mus cuius 


| 29 [39] 



Beispiel 7 : Klonierung der A6-Desaturase aus Physcomitrella 
25 patens 

Die genomische A6-Acyllipid-Desaturase aus Physcomitrella patens 
wurde auf Grundlage der verof f entlichten Sequenz (Girke et al., 
Plant J., 15, 1998: 39-48) mittels Polymerasekettenreaktion und 
30 Klonierung modifiziert, isoliert und fur das erf indungsgemafce 
Verfahren eingesetzt. Dazu wurde zunachst mittels Polymerase- 
kettenreaktion unter Verwendung von zwei genspezif ischen Primern 
ein Desaturase-Fragment isoliert und in das bei Girke et al. 
(siehe oben) beschriebene Desaturasegen eingesetzt. 

35 

Primer TG5 : 5'- ccgctcgagcgaggttgttgtggagcggc und 
Primer TG3 : 5 x -ctgaaatagtcttgctcc-3 ' 

dienten zunachst zur Amplif izierung eines Genf ragmentes mittels 
40 Polymerasekettenreaktion (30 Zyklen, 30 sek. 94° V, 30 sek. 50°C, 
60 sek. 72°C, 10 min Nachinkubation bei 72°C, in einem Perkin 
Elmer Thermocycler ) . 



45 
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Klonierung eines Expressionsplasmids , das die A6-Desaturase 
unter Kontrolle des 35S CaMV Promotors exprimiert: 

Durch Primer TG5 wurde eine Xhol Schnittstelle in das 
Fragment eingefiihrt. Ein XhoI/Eco47III Fragment wurde durch 
Restriktion erhalten und in die bei Girke et al . beschriebene 
PPDES6-Gensequenz nach analoger Restriktion mit XhoI/Eco47III 
ausgetauscht . Das Konstrukt erhielt den Namen pZK. Das Insert 
von pZK wurde als Xhol/Hindlll Fragment nach Auffullen der 
Hindlll-Schnittstelle mit Nukleotiden durch Behandlung mit 
dem Klenow Fragment der DNA Polymerase I in die Xhol/Smal 
Schnittstellen von pRT9 9/3 5S kloniert . Das resultierende 
Plasmid pSK enthalt den 35S-Promotor [Caulif lower-Mosaik-Vi- 
rus, Franck et al . (1980) Cell 21, 285], die A6-Desaturase 
aus Moos und den 35S-Terminator im Vektor pRT. 

Konstruktion eines Expressionskonstruktes unter Kontrolle 
des Napin-Promotors : 

Durch Schneiden des Plasmides pSK mit Xhol, Behandlung mit 
T4-DNA Polymerase und Pstl-Restriktion wurde das erhaltene 
Promotor-Desaturase-Fragment mit Terminator in den Vektor 
pJH3 kloniert. Dazu wurde der Vektor BamHI geschnitten und 
mit Klenow-Enzym die Uberhange aufgefullt sowie anschlieSend 
mit PstI nachgeschnitten. Es entstand durch Ligation des 
Desaturase-Terminator-Fragmentes in den Vektor das Plasmid 
pJH7, das einen Napin-Promotor beinhaltet (Scofield et al . , 
1987, J. Biol. Chem. 262, 12202-8). Die Expressionskassette 
aus pJH7 wurde mit Bspl20I und NotI geschnitten und in den 
binaren Vektor pRE kloniert. Es entstand das Plasmid pRE- 
Ppdes6 . 

In einer PCR Reaktion wurde die erf indungsgemaSe 
A6-Desaturase cDNA aus P. patens als Matrize verwendet. 
Mithilfe der nachfolgend aufgefuhrten Oligonukleotide wurde 
eine BamHI-Restriktionsschnittstelle vor dem Startcodon und 
drei Adeninnukleotide als Konsensustranslationssequenz fur 
Eukaryoten in die A6-Desaturase cDNA eingefiihrt . Es wurde 
ein 1512 Basenpaar fragment der A6-Desaturase amplifiziert 
und seguenziert. 

Pp-d6Desl: 5 ' - CC GGTACC aaaatggtattcgcgggcggtg -3' 
Pp-d6Des2: 3 ' - CC GGTACC ttaactggtggtagcatgct -3' 

Die Reaktionsgemische enthielten ca. 1 ng/micro 1 Matrizen 
DNA, 0,5 \m der Oligonukleotide und, 200 JIM Desoxy-Nukleotide 
(Pharmacia), 50 mM KCl, 10 mM Tris-HCl (pH 8,3 bei 25°C, 
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1,5 mM MgCl 2 ) und 0,02 U/JJ.1 Pwo Polymerase (Boehringer Mann- 
heim) und werden in einer PCR-Mas chine der Firma Perkin Elmer 
mit folgendem Temperaturprograiran inkubiert: 

5 Anlagerungstemperatur : 50°C, 30 sec 

Denaturierungstemperatur : 95°C, 30 sec 

Elongations temperatur : 72°C, 90 sec 

Anzahl der Zyklen: 3 0 

10 c) Konstruktion eines Expressionskonstruktes unter Kontrolle des 
USP-Promotors : 

Das erhaltene Fragment von ca. 1,5 kB Basenpaaren wurde in 
den mit EcoRV gespaltenen Vektor pBluescript SK- (Stratagene) 
ligiert und stand fur weitere Klonierungen als BamHI Fragment 
15 zur Verfiigung. 

Fur die Transformation von Pflanzen wurde ein weiterer Trans- 
f ormationsvektor auf Basis von pBin-USP erzeugt, der das 
BamHI-Fragment der A6-Desaturase enthalt. pBin-USP ist ein 

20 Derivat des Plasmides pBinl9. pBinUSP entstand aus pBinl9, 

indem in pBinl9 [Bevan et al . (1980) Nucl. Acids Res. 12, 
8711] ein USP-Promotor als EcoRI-BaMHI-Fragment inseriert 
wurde. Das Polyadenylierungssignal ist das des Gens 3 der 
T-DNA des Ti-Plasmides pTiACHS (Gielen et al . , (1984) EMBO 

25 J. 3, 835), wobei Nukleotide 11749-11939 als PvuII-Hindlll- 

Fragment isoliert und nach Addition von Sphl-Linkern an die 
PvuII-Schnittstelle zwischen die SpHI-Hindlll Schnittstelle 
des Vektors kloniert. Der USP-Promotor entspricht den Nukleo- 
tiden 1-684 (Genbank Accession X56240) , wobei ein Teil der 

30 nichtcodierenden Region des USP-Gens im Promotor enthalten 

ist. Das 684 Basenpaar gro£e Promotorf ragment wurde mittels 
kauf lichen T7-Standardprimer (Stratagene) und mit Hilfe eines 
synthetisierten Primers iiber eine PCR-Reaktion nach Standard- 
methoden amplifiziert ( Primers equenz : 5 ' -GTCGACCCGCGGACTAGTG- 

35 GGCCCTCTAGACCCGGGGGATCC GGATCTGCTGGCTATGAA-3 ' ) . Das PCR- 

Fragment wurde mit EcoRI/Sall nachgeschnitten und in den 
Vektor pBinl9 mit OCS Terminator eingesetzt. Es entstand 
das Plasmid mit der Bezeichnung pBinUSP. 

40 d) Konstruktion eines Expressionskonstruktes unter Kontrolle 
des vATPase-Cl-Promotors aus Beta vulgaris : 

Analog zum Expressionsplasmid mit dem USP-Promotor wurde 
ein Konstrukt unter Verwendung des v-ATPase-cl-Promotors 
45 erstellt. Der Promotor wurde als EcoRI/Kpnl Fragment in das 

Plasmid pBinl9 mit OCS Terminator kloniert und tiber BamHI 
das A6-Desaturasegen aus P. patens zwischen Promotor und 
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Terminator inseriert. Der Promotor entspricht einem 1153 
Basenpaarfragment aus beta-Vulgaris (Plant Mol Biol, 1999, 
39:463-475) . 

5 Das Konstrukt wurde zur Transformation von Arabidopsis 

thaliana und Rapspf lanzen eingesetzt . 

Beispiel 8 : Erzeugung transgener Rapspf lanzen (verandert nach 

Moloney et al., 1992, Plant Cell Reports, 8:238-242) 

10 

Zur Erzeugung transgener Rapspflanzen wurden binare Vektoren in 
Agrobacterium tumefaciens C58C1 :pGV2260 oder Escherichia coli 
genutzt (Deblaere et al , 1984, Nucl. Acids. Res. 13, 4777-4788). 
Zur Transformation von Rapspflanzen (Var. Drakkar, NPZ Nord- 

15 deutsche Pf lanzenzucht , Hohenlieth, Deutschland) , wurde eine 1:50 
Verdunnung einer Ubernachtkultur einer positiv trans f ormierten 
Agrobakterienkolonie in Murashige-Skoog Medium (Murashige und 
Skoog 1962 Physiol. Plant. 15, 473) mit 3 % Saccharose (3MS- 
Medium) benutzt. Petiolen oder Hypokotyledonen frisch gekeimter 

20 steriler Rapspflanzen (zu je ca. 1 cm 2 ) wurden in einer Petri - 
schale mit einer 1:50 Agrobakterienverdunnung fur 5-10 Minuten 
inkubiert . Es folgte eine 3-tagige Inkubation in Dunkelheit bei 
25°C auf 3MS-Medium mit 0,8 % Bacto-Agar. Die Kultivierung wurde 
nach 3 Tagen mit 16 Stunden Licht/8 Stunden Dunkelheit weiter- 

25 gefiihrt und in wochentlichem Rhythmus auf MS-Medium mit 500 mg/1 
Claforan (Cef otaxime-Natrium) , 50 mg/1 Kanamycin, 20 \m Benzyl- 
aminopurin (BAP) und 1,6 g/1 Glukose weitergefiihrt . Wachsende 
Sprosse wurden auf MS-Medium mit 2 % Saccharose, 250 mg/1 
Claforan und 0,8 % Bacto-Agar iiberfuhrt. Bildeten sich nach 

30 drei Wochen keine Wurzeln, so wurde als Wachstumshormon 2-Indol- 
buttersaure zum Bewurzeln zum Medium zugegeben. 

Regenerierte Sprosse wurden auf 2MS-Medium mit Kanamycin 

und Claforan erhalten, nach Bewurzelung in Erde iiberfuhrt und 

35 nach Kultivierung fur zwei Wochen in einer Klimakammer oder im 
Gewachshaus angezogen, zur Blute gebracht, reife Samen geerntet 
und auf A6-Desaturase-Expression mittels Lipidanalysen unter- 
sucht. Linien mit erhohten Gehalten an oder Doppelbindungen an 
der A6-Position wUrden identif iziert . Es konnte in den stabil 

40 transf ormierten transgenen Linien, die das Transgen funktionell 
exprimierten, ein erhohter Gehalt von Doppelbindungen an 
der A6-Position im Vergleich zu untransf ormierten Kontroll- 
pf lanzen feststellt werden. 
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Beispiel 8 : Lipidextraktion aus Samen 

Das Pf lanzenmaterial wurde zunachst mechanisch durch Morsern 
homogenisiert , um es einer Extraktion zuganglicher zu machen. 

5 

Daxm wurde es 10 min bei 100°C abgekocht und nach dem Abkiihlen auf 
Eis sediment iert . Das Zellsediment wurde mit 1 N methanolischer 
Schwef elsaure und 2 % Dimethoxypropan Ih bei 90°C hydrolysiert 
und die Lipide transmethyliert . Die resultierenden Fettsaure- 

10 methylester (FAME) wurden in Petrolether extrahiert. Die extra- 
hierten FAME wurden durch Gasf liissigkeitschromatographie mit 
einer Kapillarsaule (Chrompack, WCOT Fused Silica, CP-Wax-52 CB, 
25 m, 0,32 mm) und einem Temperaturgradienten von 170°C auf 240°C 
in 20 min und 5 min bei 240°C analysiert. Die Identitat der Fett- 

15 sauremethylester wurde durch Vergleich mit entsprechenden FAME- 
Standards (Sigma) bestatigt. Die Identitat und die Position der 
Doppelbindung konnte durch geeignete chemische Derivatisierung 
der FAME-Gemische z.B. zu 4 , 4-Dimethoxyoxazolin-Derivaten 
(Christie, 1997, in: Advances in Lipid Methodology, 4. Auflage: 

20 Christie, Oily Press, Dundee, 119-169, und 1998, Gaschromato- 

graphie-Massenspektrometrie Verf ahren, Lipide 33:343-353) mittels 
GC-MS weiter analysiert werden. Die GC-Analysen der Fettsaure- 
methylester aus den transgenen Rapssamen, die samenspezif isch die 
A6-Desaturase exprimierten sind in Tabelle III dargestellt. Die 

25 transgenen Rapssamen weisen mindestens 4,95 % y-Linolensaure im 
Samen auf. 

Tabelle III gibt die GC-Analysen der Fettsauremethylester 
aus reifen, transgenen Rapssamen, die A6-Desaturase samen- 
30 spezifisch exprimieren, wieder. Die Fettsaurezusammensetzung ist 
in [mol %] der Gesamtf ettsauren angegeben. Es ist f estzustellen, 
da& einzelne Pflanzen der T2 Generation, die aus positiv trans- 
form! erten und geselbsteten Pflanzen erhalten wurden, bis zu ca. 
4,95 % y-Linolensaure enthalten. 
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Tabelle III: GC-Analysen der Fettsauremethylester von Raps 



Bezeichnung 


18:0 


18:1 


18 :2 


18:3 (7) 


18 : 3 (a) 


18 : 4 
















R2-T2-ll/la 


1, 98 


53 , 58 


22 , 63 


3 , 86 


i 1 r> 

11, 38 


0 


R2-T2-ll/lb 


1, 86 


52, 04 


25 , 45 


2,31 


1 1 "in 

11, 39 


0 


R2-T2-ll/lc 


1, 95 


49, 17 


24, 30 


2 , 84 


9,20 


□ 


R2-T2-11/3 


1, 82 


49 , 83 


24 , 54 


*a q q 
J, oo 


in i o 


A 
U 


R2-T2-11/4 


1, 72 


48,02 


24, 66 


4,95 


9,52 


0 


R2-T2-ll/5a 


1,73 


51,98 


25,27 


4,27 


9, 61 


0 


R2-T2-ll/5b 


2,02 


56,19 


25, 08 


0 


9, 33 


0 


R2-T2-ll/5c 


2, 01 


46,95 


27, 38 


0 


10, 37 


0 


R2-T2-ll/5d 


1,83 


49,49 


24,15 


4,40 


8, 65 


0 


R2-T2-11/6 


2, 08 


54,52 


23,94 


2,05 


9,29 


0 


R2-T2-11/10 


1,94 


53,92 


22, 81 


4,06 


9,44 


0 


R2-T2-WT 


1,90 


47,75 


30,91 


0 


10, 51 


0 
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Patentanspriiche 



1. Verfahren zur Herstellung von ungesattigten Fettsauren, da- 
5 durch gekennzeichnet , daS mindestens eine isolierte Nuklein- 

sauresequenz, die fur ein Polypeptid mit A6-Desaturase- 
aktivitat codiert, ausgewahlt aus der Gruppe : 



a) einer Nukleinsauresequenz mit der in SEQ ID NO: 1 dar- 
10 gestellten Seguenz , 

b) Nukleinsauresequenzen, die sich als Ergebnis des 
degenerierten genet ischen Codes von der in SEQ ID NO: 1 
ableiten 

15 

c) Derivate der in SEQ ID NO: 1 dargestellten Nukleinsaure- 
sequenz, die fur Polypeptide mit der in SEQ ID NO: 2 dar- 
gestellten Amino s aures equenz en codieren und mindestens 

50 % Homologie auf Aminosaureebene aufweisen, ohne da£ 
20 die enzymatische Wirkung der Polypeptide wesentlich 

reduziert ist, 



in einen Organismus eingebracht wird, dieser Organismus 
angezogen wird, wobei der angezogene Organismus mindestens 
25 1 Mol-% ungesattigte Fettsauren bezogen auf den gesamten 

Fettsauregehalt im Organismus enthalt. 



2. Verfahren nach Anspruch 1, dadurch gekennzeichnet, da£ die 
Nukleinsauresequenz von einer Pflanze oder Alge stammt. 

30 

3. Verfahren nach Anspruch 1 oder 2, dadurch gekennzeichnet, 
da£ die Nukleinsauresequenz von Physcomitrella patens stammt . 

4. Verfahren nach den Anspriichen 1 bis 3, dadurch gekenn- 

35 zeichnet, daS es sich bei dem Organismus um ein organismus 

ausgewahlt aus der Gruppe Bakterium, Pilz, Ciliat, Alge, 
Cyanobakterium, Tier oder Pflanze handelt. 

5. Verfahren nach den Anspriichen 1 bis 4, dadurch gekenn- 
40 zeichnet, da& es sich bei dem Organismus um eine Pflanze 

oder Alge handelt. 
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5 7. 



10 8. 



9. 

15 



20 



25 



Verfahren nach den Anspriichen 1 bis 5, dadurch gekenn- 
zeichnet, da£ es sich bei dem Organismus urn eine Olfrucht- 
pflanzen handelt. 

Verfahren nach den Anspriichen 1 bis 6, dadurch gekenn- 
zeichnet, daS der angezogene Organismus mindestens 5 Gew-% 
ungesattigte Fettsauren bezogen auf den gesamten Fettsaure- 
gehalt im Organismus enthS.lt. 

Verfahren nach den Anspriichen 1 bis 7, dadurch gekenn- 
zeichnet, daS die ungesattigten Fettsauren aus dem Organismus 
isoliert werden. 

Transgener Organismus ausgewahlt aus der Gruppe Pflanzen, 
Pilze, Ciliaten, Algen, Bakterien, Cyanobakterien oder Tiere, 
die mindestens eine isolierte Nukleinsauresequenz enthalten, 
die fur ein Polypeptid mit A6-Desaturaseaktivitat codiert, 
ausgewahlt aus der Gruppe: 

a) einer Nukleinsauresequenz mit der in SEQ ID NO: 1 
dargestellten Sequenz , 

b) Nukleinsaureseguenzen, die sich als Ergebnis des 
degenerierten genetischen Codes von der in SEQ ID NO: 1 
ableiten 

c) Derivate der in SEQ ID NO: 1 dargestellten Nukleinsaure- 
sequenz, die fur Polypeptide mit der in SEQ ID NO: 2 dar- 
gestellten Aminosauresequenzen codieren und mindestens 

5 0 % Homologie auf Aminosaureebene aufweisen, ohne da£ 
die enzymatische Wirkung der Polypeptide wesentlich 
reduziert ist. 

Transgener Organismus nach Anspruch 9, dadurch gekenn- 
zeichnet, daS es sich bei dem Organismus um eine Pflanze 
oder Alge handelt . 

01, Lipide oder Fettsauren oder eine Fraktion davon, her- 
gestellt durch das Verfahren nach einem der Anspriiche 1 
bis 8. 

Verwendung der 01-, Lipid- oder Fettsaurezusammensetzung nach 
Anspruch 11 oder transgene Organismen nach Anspruch 9 in 
Futtermitteln, Nahrungsmitteln, Kosmetika oder Pharmazeutika . 
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SEQUENZ PROTOKOLL 

<170> Patent In Vers. 2.0 

<210> 1 

<211> 2012 

<212> DNA 

<213> Physcomitrella patens 

<220> 
<221> CDS 

<222> (319) . . (1896) 
<400> 1 

ccgagtcgcg gatcagccat cgcccgccca gggccgcctg cattgtgtgg gacggtgttg 60 

gaggaggagg cagatgcgcg ggcgttggtg gagtcgtcat ccgaggatct actgcggcaa 120 

tacctccggg ttttggagcg ggcaaactct gttgcggctc ggaaggctat aggttcggca 180 

ggagactgtt gattttatgt cgggggcatt gccattgtgg agagcggggg agactcagga 240 

tctgtgagtg tgcgtgcagc gccccgactg ccgcagagcg tctgtgtatg acgaggttgt 3 00 

tgtggagcgg cttttgaa atg gta ttc gcg ggc ggt gga ctt cag cag ggc 351 

Met Val Phe Ala Gly Gly Gly Leu Gin Gin Gly 
15 10 

tct etc gaa gaa aac ate gac gtc gag cac att gec agt atg tct etc 399 
Ser Leu Glu.Glu Asn lie Asp Val Glu His lie Ala Ser Met Ser Leu 
15 20 25 

ttc age gac ttc ttc agt tat gtg tct tea act gtt ggt teg tgg age 447 
Phe Ser Asp Phe Phe Ser Tyr Val Ser Ser Thr Val Gly Ser Trp Ser 
30 35 40 

gta cac agt ata caa cct ttg aag cgc ctg acg agt aag aag cgt gtt 495 
Val His Ser lie Gin Pro Leu Lys Arg Leu Thr Ser Lys Lys Arg Val 
45 50 55 

teg gaa age get gee gtg caa tgt ata tea get gaa gtt cag aga aat 543 
Ser Glu Ser Ala Ala Val Gin Cys lie Ser Ala Glu Val Gin Arg Asn 
60 65 70 75 

teg agt ace cag gga act gcg gag gca etc gca gaa tea gtc gtg aag 591 
Ser Ser Thr Gin Gly Thr Ala Glu Ala Leu Ala Glu Ser Val Val Lys 

80 85 90 

ccc acg aga cga agg tea tct cag tgg aag aag teg aca cac ccc eta 639 
Pro Thr Arg Arg Arg Ser Ser Gin Trp Lys Lys Ser Thr His Pro Leu 
95 100 105 
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tea gaa gta gca gta cac aac aag cca age gat tgc tgg att gtt gta 687 
Ser Glu Val Ala Val His Asn Lys Pro Ser Asp Cys Trp lie Val Val 
110 115 120 

aaa aac aag gtg tat gat gtt tec aat ttt gcg gac gag cat ccc gga 735 
Lys Asn Lys Val Tyr Asp Val Ser Asn Phe Ala Asp Glu His Pro Gly 
125 130 135 

gga tea gtt att agt act tat ttt gga cga gac ggc aca gat gtt ttc 783 
Gly Ser Val lie Ser Thr Tyr Phe Gly Arg Asp Gly Thr Asp Val Phe 
140 145 150 155 

tct agt ttt cat gca get tct aca tgg aaa att ctt caa gac ttt tac 831 
Ser Ser Phe His Ala Ala Ser Thr Trp Lys lie Leu Gin Asp Phe Tyr 
160 165 170 

att ggt gac gtg gag agg gtg gag ccg act cca gag ctg ctg aaa gat 879 
lie Gly Asp Val Glu Arg Val Glu Pro Thr Pro Glu Leu Leu Lys Asp 
175 180 185 

ttc cga gaa atg aga get ctt ttc ctg agg gag caa ctt ttc aaa agt 927 
Phe Arg Glu Met Arg Ala Leu Phe Leu Arg Glu Gin Leu Phe Lys Ser 
190 195 200 

teg aaa ttg tac tat gtt atg aag ctg etc acg aat gtt get att ttt 975 
Ser Lys Leu Tyr Tyr Val Met Lys Leu Leu Thr Asn Val Ala lie Phe 
205 210 215 

get gcg age att gca ata ata tgt tgg age aag act att tea gcg gtt 1023 
Ala Ala Ser lie Ala lie lie Cys Trp Ser Lys Thr lie Ser Ala Val 
220 225 230 235 

ttg get tea get tgt atg atg get ctg tgt ttc caa cag tgc gga tgg 1071 
Leu Ala Ser Ala Cys Met Met Ala Leu Cys Phe Gin Gin Cys Gly Trp 
240 245 250 

eta tec cat gat ttt etc cac aat cag gtg ttt gag aca cgc tgg ctt 1119 
Leu Ser His Asp Phe Leu His Asn Gin Val Phe Glu Thr Arg Trp Leu 
255 260 265 

aat gaa gtt gtc ggg tat gtg ate ggc aac gee gtt ctg ggg ttt agt 1167 
Asn Glu Val Val Gly Tyr Val lie Gly Asn Ala Val Leu Gly Phe Ser 
270 275 280 

aca ggg tgg tgg aag gag aag cat aac ctt cat cat get get cca aat 1215 
Thr Gly Trp Trp Lys Glu Lys His Asn Leu His His Ala Ala Pro Asn 
285 290 295 



gaa tgc gat cag act tac caa cca att gat gaa gat att gat act etc 
Glu Cys Asp Gin Thr Tyr Gin Pro lie Asp Glu Asp He Asp Thr Leu 
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300 305 310 315 

ccc etc att gee tgg age aag gac ata ctg gec aca gtt gag aat aag 1311 

Pro Leu lie Ala Trp Ser Lys Asp lie Leu Ala Thr Val Glu Asn Lys 
320 325 330 

aca ttc ttg cga ate etc caa tac cag cat ctg ttc ttc atg ggt ctg 13 59 

Thr Phe Leu Arg lie Leu Gin Tyr Gin His Leu Phe Phe Met Gly Leu 
335 340 345 

tta ttt ttc gec cgt ggt agt tgg etc ttt tgg age tgg aga tat acc 1407 

Leu Phe Phe Ala Arg Gly Ser Trp Leu Phe Trp Ser Trp Arg Tyr Thr 
350 355 360 

tct aca gca gtg etc tea cct gtc gac agg ttg ttg gag aag gga act 1455 

Ser Thr Ala Val Leu Ser Pro Val Asp Arg Leu Leu Glu Lys Gly Thr 

365 370 375 

gtt ctg ttt cac tac ttt tgg ttc gtc ggg aca gcg tgc tat ctt etc 1503 

Val Leu Phe His Tyr Phe Trp Phe Val Gly Thr Ala Cys Tyr Leu Leu 
380 385 390 395 

cct ggt tgg aag cca tta gta tgg atg gcg gtg act gag etc atg tec 1551 

Pro Gly Trp Lys Pro Leu Val Trp Met Ala Val Thr Glu Leu Met Ser 
400 405 410 

ggc atg ctg ctg ggc ttt gta ttt gta ctt age cac aat ggg atg gag 1599 

Gly Met Leu Leu Gly Phe Val Phe Val Leu Ser His Asn Gly Met Glu 
415 420 425 

gtt tat aat teg tct aaa gaa ttc gtg agt gca cag ate gta tec aca 1647 

Val Tyr Asn Ser Ser Lys Glu Phe Val Ser Ala Gin lie Val Ser Thr 
430 435 440 

egg gat ate aaa gga aac ata ttc aac gac tgg ttc act ggt ggc ctt 1695 

Arg Asp lie Lys Gly Asn lie Phe Asn Asp Trp Phe Thr Gly Gly Leu 

445 450 455 

aac agg caa ata gag cat cat ctt ttc cca aca atg ccc agg cat aat 1743 

Asn Arg Gin lie Glu His His Leu Phe Pro Thr Met Pro Arg His Asn 
460 465 470 475 

tta aac aaa ata gca cct aga gtg gag gtg ttc tgt aag aaa cac ggt 1791 

Leu Asn Lys He Ala Pro Arg Val Glu Val Phe Cys Lys Lys His Gly 
480 485 490 

ctg gtg tac gaa gac gta tct att get acc ggc act tgc aag gtt ttg 1839 

Leu Val Tyr Glu Asp Val Ser He Ala Thr Gly Thr Cys Lys Val Leu 
495 500 505 

aaa gca ttg aag gaa gtc gcg gag get gcg gca gag cag cat get acc 1887 
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Lys Ala Leu Lys Glu Val Ala Glu Ala Ala Ala Glu Gin His Ala Thr 
510 515 520 

acc agt taa cagtctttgg aaagcttggc aattgatctt tattctccac 193 6 

Thr Ser 
525 

ggcagttgct tgtttgtttt ggggtgaatg accgaatgta ctggcatcca ttcttctgta 1996 
gccatcaatt ttgaac 2 012 



<210> 2 
<211> 525 
<212> PRT 

<213> Physcomitrella patens 
<400> 2 

Met Val Phe Ala Gly Gly Gly Leu Gin Gin Gly Ser Leu Glu Glu Asn 
15 10 15 

lie Asp Val Glu His lie Ala Ser Met Ser Leu Phe Ser Asp Phe Phe 
20 25 30 

Ser Tyr Val Ser Ser Thr Val Gly Ser Trp Ser Val His Ser He Gin 
35 40 45 

Pro Leu Lys Arg Leu Thr Ser Lys Lys Arg Val Ser Glu Ser Ala Ala 
50 55 60 

Val Gin Cys He Ser Ala Glu Val Gin Arg Asn Ser Ser Thr Gin Gly 
65 70 75 80 

Thr Ala Glu Ala Leu Ala Glu Ser Val Val Lys Pro Thr Arg Arg Arg 

85 90 95 

Ser Ser Gin Trp Lys Lys Ser Thr His Pro Leu Ser Glu Val Ala Val 
100 105 110 

His Asn Lys Pro Ser Asp Cys Trp He Val Val Lys Asn Lys Val Tyr 
115 120 125 

Asp Val Ser Asn Phe Ala Asp Glu His Pro Gly Gly Ser Val He Ser 
130 135 140 

Thr Tyr Phe Gly Arg Asp Gly Thr Asp Val Phe Ser Ser Phe His Ala 
145 150 155 160 



Ala Ser Thr Trp Lys He Leu Gin Asp Phe Tyr He Gly Asp Val Glu 
165 170 175 
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Arg Val Glu Pro Thr Pro Glu Leu Leu Lys Asp Phe Arg Glu Met Arg 
180 185 190 

Ala Leu Phe Leu Arg Glu Gin Leu Phe Lys Ser Ser Lys Leu Tyr Tyr 
195 200 205 

Val Met Lys Leu Leu Thr Asn Val Ala He Phe Ala Ala Ser He Ala 
210 215 220 

He He Cys Trp Ser Lys Thr He Ser Ala Val Leu Ala Ser Ala Cys 
225 230 235 ■ 240 

Met Met Ala Leu Cys Phe Gin Gin Cys Gly Trp Leu Ser His Asp Phe 
245 250 255 

Leu His Asn Gin Val Phe Glu Thr Arg Trp Leu Asn Glu Val Val Gly 
260 265 270 

Tyr Val He Gly Asn Ala Val Leu Gly Phe Ser Thr Gly Trp Trp Lys 
275 280 285 

Glu Lys His Asn Leu His His Ala Ala Pro Asn Glu Cys Asp Gin Thr 
290 295 300 

Tyr Gin Pro He Asp Glu Asp He Asp Thr Leu Pro Leu He Ala Trp 
305 310 315 320 

Ser Lys Asp He Leu Ala Thr Val Glu Asn Lys Thr Phe Leu Arg He 
325 330 335 

Leu Gin Tyr Gin His Leu Phe Phe Met Gly Leu Leu Phe Phe Ala Arg 
340 345 350 

Gly Ser Trp Leu Phe Trp Ser Trp Arg Tyr Thr Ser Thr Ala Val Leu 
355 360 365 

Ser Pro Val Asp Arg Leu Leu Glu Lys Gly Thr Val Leu Phe His Tyr 
370 375 380 

Phe Trp Phe Val Gly Thr Ala Cys Tyr Leu Leu Pro Gly Trp Lys Pro 
385 390 395 400 

Leu Val Trp Met Ala Val Thr Glu Leu Met Ser Gly Met Leu Leu Gly 
405 410 415 

Phe Val Phe Val Leu Ser His Asn Gly Met Glu Val Tyr Asn Ser Ser 
420 425 430 



Lys Glu Phe Val Ser Ala Gin He Val Ser Thr Arg Asp He Lys Gly 
435 440 445 
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Asn lie Phe Asn Asp Trp Phe Thr Gly Gly Leu Asn Arg Gin lie Glu 
450 455 460 

His His Leu Phe Pro Thr Met Pro Arg His Asn Leu Asn Lys lie Ala 
465 470 475 480 

Pro Arg Val Glu Val Phe Cys Lys Lys His Gly Leu Val Tyr Glu Asp 
485 490 495 

Val Ser lie Ala Thr Gly Thr Cys Lys Val Leu Lys Ala Leu Lys Glu 
500 505 510 



Val Ala Glu Ala Ala Ala Glu Gin His Ala Thr Thr Ser 
515 520 525 
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METHODS AND COMPOSITIONS FOR SYNTHESIS OF 
LONG CHAIN POLYUNSATURATED FATTY ACIDS IN PLANTS 

CROSS-RE FERENCE TO RELATED APPLICATIONS 

This application is a continuation-in-part of USSN 08/834,655, filed 
5 April 11, 1997, and a continuation in part of USSN 08/833,610, filed April 11, 
1997, USSN 08/834,033 filed April 11, 1997 and USSN 08/956,985 filed 
October 24, 1997 which disclosures are incorporated herein by reference. 

INTRODUCTION 

Field of the Invention 

10 This invention relates to modulating levels of enzymes and/or enzyme 

components capable of altering the production of long chain polyunsaturated 
fatty acids (PUFAS) in a host plant. The invention is exemplified by the 
production of PUFAS in plants. 

Background 

15 Two main families of polyunsaturated fatty acids (PUFAs) are the <o3 

fatty acids, exemplified by arachidonic acid, and the co6 fatty acids, exemplified 
by eicosapentaenoic acid. PUFAs are important components of the plasma 
membrane of the cell, where they may be found in such forms as phospholipids. 
PUFAs also serve as precursors to other molecules of importance in human 

20 beings and animals, including the prostacyclins, leukotrienes and 

prostaglandins. PUFAs are necessary for proper development, particularly in 
the developing infant brain, and for tissue formation and repair. 

Four major long chain PUFAs of importance include docosahexaenoic 

acid (DHA) and eicosapentaenoic acid (EPA), which are primarily found in 

25 different types of fish oil, gamma-linolenic acid (GLA), which is found in the 

seeds of a number of plants, including evening primrose (Oenothera biennis), 

borage (Borago officinalis) and black currants (Ribes nigrum), and stearidonic 

acid (SDA), which is found in marine oils and plant seeds. Both GLA and 

another important long chain PUFA, arachidonic acid (ARA), are found in 
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filamentous fungi. ARA can be purified from animal tissues including liver and 
adrenal gland. 

For DHA, a number of sources exist for commercial production 
including a variety of marine organisms, oils obtained from cold water marine 
5 fish, and egg yolk fractions. For ARA, microorganisms including the genera 
Mortierella, Entomophthora, Phytium and Porphyridium can be used for 
commercial production. Commercial sources of SDA include the genera 
Trichodesma and Echium. Commercial sources of GLA include evening 
primrose, black currants and borage. However, there are several disadvantages 
10 associated with commercial production of PUFAs from natural sources. Natural 
sources of PUFAs, such as animals and plants, tend to have highly 
heterogeneous oil compositions. The oils obtained from these sources therefore 
can require extensive purification to separate out one or more desired PUFAs or 
to produce an oil which is enriched in one or more PUFA. Natural sources also 
1 5 are subject to uncontrollable fluctuations in availability. Fish stocks may 
undergo natural variation or may be depleted by overfishing. Fish oils have 
unpleasant tastes and odors, which may be impossible to economically separate 
from the desired product, and can render such products unacceptable as food 
supplements. Animal oils, and particularly fish oils, can accumulate 
20 environmental pollutants. Weather and disease can cause fluctuation in yields 
from both fish and plant sources. Cropland available for production of alternate 
oil-producing crops is subject to competition from the steady expansion of 
human populations and the associated increased need for food production on the 
remaining arable land. Crops which do produce PUFAs, such as borage, have 
25 not been adapted to commercial growth and may not perform well in 

monoculture. Growth of such crops is thus not economically competitive where 
more profitable and better established crops can be grown. Large scale 
fermentation of organisms such as Mortierella is also expensive. Natural 
animal tissues contain low amounts of ARA and are difficult to process. 
30 Microorganisms such as Porphyridium and Mortierella are difficult to cultivate 
on a commercial scale. 
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Dietary supplements and pharmaceutical formulations containing 
PUFAs can retain the disadvantages of the PUFA source. Supplements such as 
fish oil capsules can contain low levels of the particular desired component and 
thus require large dosages. High dosages result in ingestion of high levels of 
5 undesired components, including contaminants. Care must be taken in 

providing fatty acid supplements, as overaddition may result in suppression of 
endogenous biosynthetic pathways and lead to competition with other necessary 
fatty acids in various lipid fractions in vivo, leading to undesirable results. For 
example, Eskimos having a diet high in co3 fatty acids have an increased 
10 tendency to bleed (U.S. Pat. No. 4,874,603). Unpleasant tastes and odors of the 
supplements can make such regimens undesirable, and may inhibit compliance 
by the patient. 

A number of enzymes are involved in PUFA biosynthesis. Linoleic acid 
(LA, 18:2 A9, 12) is produced from oleic acid (18:1 A9) by a A12-desaturase. 

15 GLA (18:3 A6, 9, 12) is produced from linoleic acid (LA, 18:2 A9, 12) by a A6- 
desaturase. ARA (20:4 A5, 8, 1 1, 14) production from DGLA (20:3 A8, 11, 14) 
is catalyzed by a A5-desaturase. However, animals cannot desaturate beyond 
the A9 position and therefore cannot convert oleic acid (18:1 A9) into linoleic 
- acid (18:2 A9, 12). Likewise, cc-linolenic acid (ALA, 18:3 A9, 12, 15) cannot 

20 be synthesized by mammals. Other eukaryotes, including fungi and plants, have 
enzymes which desaturate at positions A21 and A15. The major poly- 
unsaturated fatty acids of animals therefore are either derived from diet and/or 
from desaturation and elongation of linoleic acid (18:2 A9, 12) or oc-linolenic 
acid (18:3 A9, 12, 15). 

25 Poly-unsaturated fatty acids are considered to be useful for nutritional, 

pharmaceutical, industrial, and other purposes. An expansive supply of poly- 
unsaturated fatty acids from natural sources and from chemical synthesis are not 
sufficient for commercial needs. Therefore it is of interest to obtain genetic 
material involved in PUFA biosynthesis from species that naturally produce 

30 these fatty acids and to express the isolated material alone or in combination in 
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a heterologous system which can be manipulated to allow production of 
commercial quantities of PUFAS. 

The present invention is further directed to formulas, dietary 
supplements or dietary supplements in the form of a liquid or a solid containing 
5 the long chain fatty acids of the invention. These formulas and supplements 
may be administered to a human or an animal. 

The formulas and supplements of the invention may further comprise at 
least one macronutrient selected from the group consisting of coconut oil, soy 
oil, canola oil, mono- and diglycerides, glucose, edible lactose, electrodialysed 
10 whey, electrodialysed skim milk, milk whey, soy protein, and other protein 
hydrolysates. 

The formulas of the present invention may further include at least one 
vitamin selected from the group consisting of Vitamins A, C, D, E, and B 
complex; and at least one mineral selected from the group consisting of 
15 calcium, magnesium, zinc, manganese, sodium, potassium, phosphorus, copper, 
chloride, iodine, selenium, and iron. 

The present invention is further directed to a method of treating a patient 
having a condition caused by insuffient intake or production of polyunsaturated 
fatty acids comprising administering to the patient a dietary substitute of the 
20 invention in an amount sufficient to effect treatment of the patient. 

The present invention is further directed to cosmetic and pharmaceutical 
compositions of the material of the invention. 

The present invention is further directed to transgenic oils in 
pharmaceutically acceptable carriers. The present invention is further directed 
25 to nutritional supplements, cosmetic agents and infant formulae containing 
transgenic oils. 

The present invention is further directed to a method for obtaining 
altered long chain polyunsaturated fatty acid biosynthesis comprising the steps 
of: growing a microbe having cells which contain a transgene which encodes a 
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transgene expression product which desaturates a fatty acid molecule at carbon 
5,5 or 12 from the carboxyl end of said fatty acid molecule, wherein the 
trangene is operably associated with an expression control sequence, under 
conditions whereby the transgene is expressed, whereby long chain 
5 polyunsaturated fatty acid biosynthesis in the cells is altered. 

The present invention is further directed toward pharmaceutical 
compositions comprising at least one nutrient selected from the group consisting 
of a vitamin, a mineral, a carbohydrate, a sugar, an amino acid, a free fatty acid, 
a phospholipid, an antioxidant, and a phenolic compound. 

10 Relevant Literature 

Production of gamma-linolenic acid by a A6-desaturase is described in 
USPN 5,552,306 and USPN 5,614,393. Production of 8, 1 1-eicosadienoic acid 
using Mortierella alpina is disclosed in USPN 5,376,541 . Production of 
docosahexaenoic acid by dinoflagellates is described in USPN 5,407,957. 

1 5 Cloning of a A6-desaturase from borage is described in PCT publication WO 
96/21022. Cloning of A9-desaturases is described in the published patent 
applications PCT WO 91/13972, EP 0 550 162 Al, EP 0 561 569 A2, EP 0 644 
263 A2, and EP 0 736 598 Al, and in USPN 5,057,419. Cloning of A12- 
desaturases from various organisms is described in PCT publication WO 

20 94/1 1516 and USPN 5,443,974. Cloning of A15-desaturases from various 

organisms is described in PCT publication WO 93/1 1245. A A6 palmitoyl-acyl 
carrier protein desaturase from Thumbergia alata and its expression in E. coli is 
described in USPN 5,614,400. Expression of a soybean stearyl-ACP desaturase 
in transgenic soybean embryos using a 35S promoter is disclosed in USPN 

25 5,443,974. 

SUMMARY OF THE INVENTION 

Novel compositions and methods are provided for preparation of poly- 
unsaturated long chain fatty acids and desaturases in plants and plant cells. The 
methods involve growing a host plant cell of interest transformed with an 
30 expression cassette functional in a host plant cell, the expression cassette 
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comprising a transcriptional and translational initiation regulatory region, joined 
in reading frame 5' to a DNA sequence encoding a desaturase polypeptide 
capable of modulating the production of PUFAs. Expression of the desaturase 
polypeptide provides for an alteration in the PUFA profile of host plant cells as 

5 a result of altered concentrations of enzymes involved in PUFA biosynthesis. 
Of particular interest is the selective control of PUFA production in plant tissues 
and/or plant parts such as leaves, roots, fruits and seeds. The invention finds 
use for example in the large scale production of DHA, EPA, ARA, and GLA 
and for modification of the fatty acid profile of edible plant tissues and/or plant 

10 parts. 

The present invention further includes a purified nucleotide sequence or 
polypeptide sequence that is substantially related or homologous to the 
nucleotide and peptide sequences presented in SEQ ID NO:l - SEQ ID NO:52. 
The present invention is further directed to methods of using the sequences 
15 presented in SEQ ID NO:l to SEQ ID NO:40 as probes to identify related 

sequences, as components of expression systems and as components of systems 
useful for producing transgenic oil. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows possible pathways for the synthesis of arachidonic acid 
20 (20:4 A5, 8, 1 1, 14) and stearidonic acid (18:4 A6, 9, 12, 15) from palmitic acid 
(Ci6) from a variety of organisms, including algae, Mortierella and humans. 
These PUFAs can serve as precursors to other molecules important for humans 
and other animals, including prostacyclins, leukotrienes, and prostaglandins, 
some of which are shown. 

25 Figure 2 shows possible pathways for production of PUFAs in addition 

to ARA, including EPA and DHA, again compiled from a variety of organisms. 

Figure 3A-E shows the DNA sequence (SEQ ID NO :l) of the 
Mortierella alpina A6 desaturase and the deduced amino acid sequence (SEQ 
IDNO:2). 



-6- 



WO 98/46764 



PCT/US98/07421 



Figure 4 shows an alignment of the Mortierella alpina A6 desaturase 
amino acid sequence with other A6 desaturases and related sequences (SEQ ID 
NOS:7, 8, 9,10,11, 12 and 13). 

Figure 5A-D shows the DNA sequence of the Mortierella alpina A12 
5 desaturase (SEQ ID NO:3) and the deduced amino acid sequence (SEQ ID 
NO:4) 

Figure 6 shows the deduced amino acid sequence (SEQ ID NO: 14) of 
the PCR fragment (see Example 1). 

Figure 7A-D shows the DNA sequence of the Mortierella alpina A5 
1 0 desaturase (SEQ ID NO:5). 

Figure 8 shows alignments of the protein sequence of the A5 desaturase 
(SEQ ID NO:6) with A6 desaturases and related sequences (SEQ ID NOS:15, 
16, 17, 18). 

Figure 9 shows alignments of the protein sequence of the Ma 29 and 
15 contig 253538a. 

Figure 10 shows alignments of the protein sequence of Ma 524 and 
contig 253538a. 

BRIEF DESCRIPTION O F THE SEQUENCE LISTINGS 
SEQ ID NO:l shows the DNA sequence of the Mortierella alpina A6 
20 desaturase. 

SEQ ID NO:2 shows the amino acid sequence of the Mortierella alpina 
A6 desaturase. 

SEQ ID NO:3 shows the DNA sequence of the Mortierella alpina A12 
desaturase. 

25 SEQ ID NO:4 shows the amino acid sequence of the Mortierella alpina 

A12 desaturase. 
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SEQ ID NO:5 shows the DNA sequence of the Mortierella alpina A5 
desaturase. 

SEQ ID NO:6 shows the amino acid sequence Mortierella alpina A5 
desaturase. 

5 SEQ ID NO:7 - SEQ ID NO: 1 3 show amino acid sequences that relate 

to Mortierella alpina A6 desaturase. 

SEQ ID NO: 14 shows an amino acid sequence of a PCR fragment of 
Example 1. 

SEQ ID NO: 15 - SEQ ID NO: 18 show amino acid sequences that relate 
1 0 to Mortierella alpina A5 and A6 desaturases. 

SEQ ID NO: 19 - SEQ ID NO:30 show PCR primer sequences. 

SEQ ID NO:31 - SEQ ID NO:37 show human nucleotide sequences. 

SEQ ID NO:38 - SEQ ID NO:44 show human peptide sequences. 

SEQ ID NO:45 - SEQ ID NO:46 show the nucleotide and amino acid 
1 5 sequence of a Dictyostelium discoideium desaturase. 

SEQ ID NO:47 - SEQ ID NO:50 show the nucleotide and deduced 
amino acid sequence of a Schizochytrium cDNA clone. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

In order to ensure a complete understanding of the invention, the 
20 following definitions are provided: 

A5-Desaturase: A5 desaturase is an enzyme which introduces a double 
bond between carbons 5 and 6 from the carboxyl end of a fatty acid molecule. 

A6-Desaturase: A6-desaturase is an enzyme which introduces a double 
bond between carbons 6 and 7 from the carboxyl end of a fatty acid molecule. 

25 A9-Desaturase: A9-desaturase is an enzyme which introduces a double 

bond between carbons 9 and 10 from the carboxyl end of a fatty acid molecule. 
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A12-Desaturase: A12-desaturase is an enzyme which introduces a 
double bond between carbons 12 and 13 from the carboxyl end of a fatty acid 
molecule. 

Fatty Acids: Fatty acids are a class of compounds containing a long 
5 hydrocarbon chain and a terminal carboxylate group. Fatty acids include the 
following: 



Fatty Acid 


12:0 


lauric acid 




16:0 


palmitic acid 




16:1 


palmitoleic acid 




18:0 


stearic acid 




18:1 


oleic acid 


A9-18:l 


18:2 A5,9 


taxoieic acid 


A5,9-18:2 


18:2 A6,9 


6,9-octadecadienoic acid 


A6,9-18:2 


18:2 


linoleic acid 


A9,12-18:2 (LA) 


18:3 A6,9,12 


gamma-linolenic acid 


A6,9,12-18:3 (GLA) 


18:3 A5,9,12 


pinolenic acid 


A5,9,12-18:3 


18:3 


alpha-linolenic acid 


A9, 12, 15- 18:3 (ALA) 


18:4 


stearidonic acid 


A6,9,12,15-18:4 (SDA) 


20:0 


Arachidic acid 




20:1 


Eicoscenic Acid 




22:0 


behehic acid 




22:1 


erucic acid 




22:2 


Docasadienoic acid 




20:4 0)6 


arachidonic acid 


A5,8,l 1,14-20:4 (ARA) 


20:3 o>6 


o)6-eicosatrienoic 
dihomo-gamma linolenic 


A8,l 1,14-20:3 (DGLA) 


20:5 0)3 


Eicosapentanoic 
(Timnodonic acid) 


A5,8,l 1,14,17-20:5 (EPA) 


20:3 0)3 


o)3-eicosatrienoic 


Al 1,16,17-20:3 


20:4 0)3 


o)3-eicosatetraenoic 


A8, 11,14,17-20:4 


22:5 0)3 


Docosapentaenoic 


A7,10,13,16,19-22:5 (w3DPA) 


22:6 o)3 


Docosahexaenoic 
(cervonic acid) 


A4,7, 10, 13, 16,1 9-22:6 (DHA) 


24:0 


Lignoceric acid 
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Taking into account these definitions, the present invention is directed to novel 
DNA sequences, DNA constructs, methods and compositions are provided 
which permit modification of the poly-unsaturated long chain fatty acid content 
of plant cells. Plant cells are transformed with an expression cassette 
5 comprising a DNA encoding a polypeptide capable of increasing the amount of 
one or more PUFA in a plant cell. Desirably, integration constructs may be 
prepared which provide for integration of the expression cassette into the 
genome of a host cell. Host cells are manipulated to express a sense or 
antisense DNA encoding a polypeptide(s) that has desaturase activity. By 

10 "desaturase" is intended a polypeptide which can desaturate one or more fatty 
acids to produce a mono- or poly-unsaturated fatty acid or precursor thereof of 
interest. By "polypeptide" is meant any chain of amino acids, regardless of 
length or post-translational modification, for example, glycosylation or 
phosphorylation. The substrate(s) for the expressed enzyme may be produced 

15 by the host cell or may be exogenously supplied. 

To achieve expression in a host cell, the transformed DNA is operably 
associated with transcriptional and translational initiation and termination 
regulatory regions that are functional in the host cell. Constructs comprising the 
gene to be expressed can provide for integration into the genome of the host cell 

20 or can autonomously replicate in the host cell. For production of linoleic acid 
(LA), the expression cassettes generally used include a cassette which provides 
for A12 desaturase activity, particularly in a host cell which produces or can 
take up oleic acid. For production of ALA, the expression cassettes generally 
used include a cassette which provides for A15 or co3 desaturase activity, 

25 particularly in a host cell which produces or can take up LA. For production of 
GLA or SDA, the expression cassettes generally used include a cassette which 
provides for A6 desaturase activity, particularly in a host cell which produces or 
can take up LA or ALA, respectively. Production of co6-type unsaturated fatty 
acids, such as LA or GLA, is favored in a plant capable of producing ALA by 

30 inhibiting the activity of a Al 5 or <o3 type desaturase; this is accomplished by 
providing an expression cassette for an antisense A15 or co3 transcript, or by 
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disrupting a A15 or o>3 desaturase gene. Similarly, production of LA or ALA is 
favored in a plant having A6 desaturase activity by providing an expression 
cassette for an antisense A6 transcript, or by disrupting a A6 desaturase gene. 
Production of oleic acid likewise is favored in a plant having A12 desaturase 
5 activity by providing an expression cassette for an antisense A12 transcript, or 
by disrupting a A12 desaturase gene. For production of ARA, the expression 
cassette generally used provides for A5 desaturase activity, particularly in a host 
cell which produces or can take up DGLA. Production of a>6-type unsaturated 
fatty acids, such as ARA, is favored in a plant capable of producing ALA by 
10 inhibiting the activity of a A15 or a>3 type desaturase; this is accomplished by 
providing an expression cassette for an antisense A15 or ©3 transcript, or by 
disrupting a Al 5 or co3 desaturase gene. 

TRANSGENIC PLANT PRODUCTION OF FATTY ACIDS 

Transgenic plant production of PUFAs offers several advantages over 

1 5 purification from natural sources such as fish or plants. Production of fatty 
acids from recombinant plants provides the ability to alter the naturally 
occurring plant fatty acid profile by providing new synthetic pathways in the 
host or by suppressing undesired pathways, thereby increasing levels of desired 
PUFAs, or conjugated forms thereof, and decreasing levels of undesired 

20 PUFAs. Production of fatty acids in transgenic plants also offers the advantage 
that expression of desaturase genes in particular tissues and/or plant parts means 
that greatly increased levels of desired PUFAs in those tissues and/or parts can 
be achieved, making recovery from those tissues more economical. For 
example, the desired PUFAs can be expressed in seed; methods of isolating 

25 seed oils are well established. In addition to providing a source for purification 
of desired PUFAs, seed oil components can be manipulated through expression 
of desaturase genes, either alone or in combination with other genes such as 
elongases, to provide seed oils having a particular PUFA profile in concentrated 
form. The concentrated seed oils then can be added to animal milks and/or 

30 synthetic or semi-synthetic milks to serve as infant formulas where human 
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nursing is impossible or undesired, or in cases of malnourishment or disease in 
both adults and infants. 

For production of PUFAs, depending upon the host cell, the availability 
of substrate, and the desired end product(s), several polypeptides, particularly 
5 desaturases, are of interest including those polypeptides which catalyze the 
conversion of stearic acid to oleic acid, LA to GLA, of ALA to SDA, of oleic 
acid to LA, or of LA to ALA, which includes enzymes which desaturate at the 
A6, A9, A12, A15 or co3 positions. Considerations for choosing a specific 
polypeptide having desaturase activity include the pH optimum of the 
10 polypeptide, whether the polypeptide is a rate limiting enzyme or a component 
thereof, whether the desaturase used is essential for synthesis of a desired poly- 
unsaturated fatty acid, and/or co-factors required by the polypeptide. The 
expressed polypeptide preferably has parameters compatible with the 
biochemical environment of its location in the host cell. For example, the 
1 5 polypeptide may have to compete for substrate with other enzymes in the host 
cell. Analyses of the K m and specific activity of the polypeptide in question 
therefore are considered in determining the suitability of a given polypeptide for 
- modifying PUFA production in a given host cell. The polypeptide used in a 
particular situation therefore is one which can function under the conditions, 
20 present in the intended host cell but otherwise can be any polypeptide having 
desaturase activity which has the desired characteristic of being capable of 
modifying the relative production of a desired PUFA. A scheme for the 
synthesis of arachidonic acid (20:4 A5, 8, 11, 14) from palmitic acid (Cie) is 
shown in Figure 1 . A key enzyme in this pathway is a A5-desaturase which 
25 converts DH-y-Hnolenic acid (DGLA, eicosatrienoic acid) to ARA. Conversion 
of a-linolenic acid (ALA) to stearidonic acid by a A6-desaturase is also shown. 
Production of PUFAs in addition to ARA, including EPA and DHA is shown in 
Figure 2. A key enzyme in the synthesis of arachidonic acid (20:4 A5, 8, 1 1, 
14) from stearic acid (Cjg) is a A6-desaturase which converts the linoleic acid 
30 into y-linolenic acid. Conversion of a-linolenic acid (ALA) to stearidonic acid 
by a A6-desaturase also is shown. For production of ARA, the DNA sequence 
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used encodes a polypeptide having A5 desaturase activity. In particular 
instances, this can be coupled with an expression cassette which provides for 
production of a polypeptide having A6 desaturase activity and, optionally, a 
transcription cassette providing for production of antisense sequences to a A 15 
transcription product. The choice of combination of cassettes used depends in 
part on the PUFA profile of the host cell. Where the host cell A5-desaturase 
activity is limiting, overexpression of A5 desaturase alone generally will be 
sufficient to provide for enhanced ARA production. 

SOURCES OF POLYPEPTIDES 
HAVING DESATURASE ACTIVITY 

As sources of polypeptides having desaturase activity and 
oligonucleotides encoding such polypeptides are organisms which produce a 
desired poly-unsaturated fatty acid. As an example, microorganisms haying an 
ability to produce ARA can be used as a source of A5-desaturase genes; 
microorganisms which GLA or SDA can be used as a source of A6-desaturase 
and/or A12-desaturase genes. Such microorganisms include, for example, those 
belonging to the genera Mortierella, Conidiobolus, Pythium, Phytophathora, 
Penicillium, Porphyridium, Coidosporium, Mucor, Fusarium, Aspergillus, 
Rhodotorula, and Entomophthora. Within the genus Porphyridium, of 
particular interest is Porphyridium cruentum. Within the genus Mortierella, of 
particular interest are Mortierella elongata, Mortierella exigua, Mortierella 
hygrophila, Mortierella ramanniana, var. angulispora, and Mortierella alpina. 
Within the genus Mucor, of particular interest are Mucor circinelloides and 
Mucor javanicus. 

DNAs encoding desired desaturases can be identified in a variety of 
ways. As an example, a source of the desired desaturase, for example genomic 
or cDNA libraries from Mortierella, is screened with detectable enzymatically- 
or chemically-synthesized probes, which can be made from DNA, RNA, or non- 
naturally occurring nucleotides, or mixtures thereof. Probes may be 
enzymatically synthesized from DNAs of known desaturases for normal or 
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reduced-stringency hybridization methods. Oligonucleotide probes also can be 
used to screen sources and can be based on sequences of known desaturases, 
including sequences conserved among known desaturases, or on peptide 
sequences obtained from the desired purified protein. Oligonucleotide probes 

5 based on amino acid sequences can be degenerate to encompass the degeneracy 
of the genetic code, or can be biased in favor of the preferred codons of the 
source organism. Oligonucleotides also can be used as primers for PCR from 
reverse transcribed mRN A from a known or suspected source; the PCR product 
can be the full length cDNA or can be used to generate a probe to obtain the 

10 desired full length cDNA. Alternatively, a desired protein can be entirely 

sequenced and total synthesis of a DNA encoding that polypeptide performed. 

Once the desired genomic or cDN A has been isolated, it can be 
sequenced by known methods. It is recognized in the art that such methods are 
subject to errors, such that multiple sequencing of the same region is routine and 

1 5 is still expected to lead to measurable rates of mistakes in the resulting deduced 
sequence, particularly in regions having repeated domains, extensive secondary 
structure, or unusual base compositions, such as regions with high GC base 
content. When discrepancies arise, resequencing can be done and can employ 
special methods. Special methods can include altering sequencing conditions 

20 by using: different temperatures; different enzymes; proteins which alter the 
ability of oligonucleotides to form higher order structures; altered nucleotides 
such as ITP or methylated dGTP; different gel compositions, for example 
adding formamide; different primers or primers located at different distances 
from the problem region; or different templates such as single stranded DNAs. 

25 Sequencing of mRNA can also be employed. 

For the most part, some or all of the coding sequence for the polypeptide 
having desaturase activity is from a natural source. In some situations, 
however, it is desirable to modify all or a portion of the codons, for example, to 
enhance expression, by employing host preferred codons. Host preferred 
30 codons can be determined from the codons of highest frequency in the proteins 
expressed in the largest amount in a particular host species of interest. Thus, the 
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coding sequence for a polypeptide having desaturase activity can be 
synthesized in whole or in part. All or portions of the DNA also can be 
synthesized to remove any destabilizing sequences or regions of secondary 
structure which would be present in the transcribed mRNA. All or portions of 
the DNA also can be synthesized to alter the base composition to one more 
preferable in the desired host cell. Methods for synthesizing sequences and 
bringing sequences together are well established in the literature. In vitro 
mutagenesis and selection, site-directed mutagenesis, or other means can be 
employed to obtain mutations of naturally occurring desaturase genes to 
produce a polypeptide having desaturase activity in vivo with more desirable 
physical and kinetic parameters for function in the host cell, such as a longer 
half-life or a higher rate of production of a desired polyunsaturated fatty acid. 

Desirable cDNAs have less than 60% A+T composition, preferably less 
than 50% A+T composition. On a localized scale of a sliding window of 20 
base pairs, it is preferable that there are no localized regions of the cDNA with 
greater than 75% A+T composition; with a window of 60 base pairs, it is 
preferable that there are no localized regions of the cDNA with greater than 
60%, more preferably no localized regions with greater than 55% A+T 
composition. 

Mortierella alping Desaturases 
Of particular interest are the Mortierella alpina A5 -desaturase, A6- 
desaturase and A12-desaturase. The A5-desaturase has 446 amino acids; the 
amino acid sequence is shown in Figure 7. The gene encoding the Mortierella 
alpina A5-desaturase can be expressed in transgenic microorganisms to effect 
greater synthesis of ARA from DGLA. Other DNAs which are substantially 
identical in sequence to the Mortierella alpina A5-desaturase DNA, or which 
encode polypeptides which are substantially identical in sequence to the 
Mortierella alpina A5-desaturase polypeptide, also can be used. The 
Mortierella alpina A6-desaturase, has 457 amino acids and a predicted 
molecular weight of 51.8 kD; the amino acid sequence is shown in Figure 3. 
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The gene encoding the Mortierella alpina A6-desaturase can be expressed in 
transgenic plants or animals to effect greater synthesis of GLA from linoleic 
acid or of stearidonic acid (SD A) from ALA. Other DN As which are 
substantially identical in sequence to the Mortierella alpina A6-desaturase 
5 DNA, or which encode polypeptides which are substantially identical in 

sequence to the Mortierella alpina A6-desaturase polypeptide, also can be used. 

The Mortierella alpina A12-desaturase has the amino acid sequence 
shown in Figure 5. The gene encoding the Mortierella alpina A12-desaturase 
can be expressed in transgenic plants to effect greater synthesis of LA from 
1 0 oleic acid. Other DNAs which are substantially identical to the Mortierella 
alpina A12-desaturase DNA, or which encode polypeptides which are 
substantially identical to the Mortierella alpina A12-desaturase polypeptide, 
also can be used. 

By substantially identical in sequence is intended an amino acid 

1 5 sequence or nucleic acid sequence exhibiting in order of increasing preference 
at least 60%, 80%, 90% or 95% homology to the Mortierella alpina A5- 
desaturase amino acid sequence or nucleic acid sequence encoding the amino 
acid sequence. For polypeptides, the length of comparison sequences generally 
is at least 16 amino acids, preferably at least 20 amino acids, or most preferably 

20 35 amino acids. For nucleic acids, the length of comparison sequences 

generally is at least 50 nucleotides, preferably at least 60 nucleotides, and more 
preferably at least 75 nucleotides, and most preferably, 110 nucleotides. 
Homology typically is measured using sequence analysis software, for example, 
the Sequence Analysis software package of the Genetics Computer Group, 

25 University of Wisconsin Biotechnology Center, 1710 University Avenue, 
Madison, Wisconsin 53705, MEGAlign (DNAStar, Inc., 1228 S. Park St., 
Madison, Wisconsin 53715), and MacVector (Oxford Molecular Group, 2105 S. 
Bascom Avenue, Suite 200, Campbell, California 95008). Such software 
matches similar sequences by assigning degrees of homology to various 

30 substitutions, deletions, and other modifications. Conservative substitutions 

typically include substitutions within the following groups: glycine and alanine; 
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valine, isoleucine and leucine; aspartic acid, glutamic acid, asparagine, and 
glutamine; serine and threonine; lysine and arginine; and phenylalanine and 
tyrosine. Substitutions may also be made on the basis of conserved 
hydrophobicity or hydrophilicity (Kyte and Doolittle, J. Mol. Biol. 157: 105- 
5 1 32, 1 982), or on the basis of the ability to assume similar polypeptide 
secondary structure (Chou and Fasman, Adv. Enzymol. 47: 45-148, 1978). 

Other Desaturases 

Encompassed by the present invention are related desaturases from the 
same or other organisms. Such related desaturases include variants of the 

1 0 disclosed A5-, A6- and Al 2-desaturases that occur naturally within the same or 
different species of Mortierella, as well as homologues of the disclosed A5- 
desaturase from other species and evolutionarily related protein having 
desaturase activity. Also included are desaturases which, although not 
substantially identical to the Mortierella alpina A5-desaturase, desaturate a fatty 

1 5 acid molecule at carbon 5, 6 or 1 2, respectively, from the carboxyl end of a fatty 
acid molecule. Related desaturases can be identified by their ability to function 
substantially the same as the disclosed desaturases; that is, are still able to 
effectively convert DGLA to ARA, LA to GLA, ALA to SDA or oleic acid to 
LA. Related desaturases also can be identified by screening sequence databases 

20 for sequences homologous to the disclosed desaturase, by hybridization of a 

probe based on the disclosed desaturase to a library constructed from the source 
organism, or by RT-PCR using mRNA from the source organism and primers 
based on the disclosed desaturase. Such desaturases includes those from 
humans, Dictyostelium discoideum and Phaeodactylum tricornum. 

The regions of a desaturase polypeptide important for desaturase activity 
can be determined through routine mutagenesis, expression of the resulting 
mutant polypeptides and determination of their activities. Mutants may include 
deletions, insertions and point mutations, or combinations thereof. A typical 
functional analysis begins with deletion mutagenesis to determine the N- and C- 
terminal limits of the protein necessary for function, and then internal deletions, 



25 



30 
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insertions or point mutants are made to further determine regions necessary for 
function. Other techniques such as cassette mutagenesis or total synthesis also 
can be used. Deletion mutagenesis is accomplished, for example, by using 
exonucleases to sequentially remove the 5* or 3' coding regions. Kits are 

5 available for such techniques. After deletion, the coding region is completed by 
ligating oligonucleotides containing start or stop codons to the deleted coding 
region after 5' or 3' deletion, respectively. Alternatively, oligonucleotides 
encoding start or stop codons are inserted into the coding region by a variety of 
methods including site-directed mutagenesis, mutagenic PCR or by ligation 

10 onto DNA digested at existing restriction sites. Internal deletions can similarly 
be made through a variety of methods including the use of existing restriction 
sites in the DNA, by use of mutagenic primers via site directed mutagenesis or 
mutagenic PCR. Insertions are made through methods such as linker-scanning 
mutagenesis, site-directed mutagenesis or mutagenic PCR. Point mutations are 

1 5 made through techniques such as site-directed mutagenesis or mutagenic PCR. 

Chemical mutagenesis can also be used for identifying regions of a 
desaturase polypeptide important for activity. A mutated construct is expressed, 
and the ability of the resulting altered protein to function as a desaturase is 
assayed. Such structure-function analysis can determine which regions may be 
20 deleted, which regions tolerate insertions, and which point mutations allow the 
mutant protein to function in substantially the same way as the native 
desaturase. All such mutant proteins and nucleotide sequences encoding them 
are within the scope of the present invention. 

EXPRESSION OF DESATURASE GENES 
25 Once the DNA encoding a desaturase polypeptide has been obtained, it 

is placed in a vector capable of replication in a host cell, or is propagated in 
vitro by means of techniques such as PCR or long PCR. Replicating vectors 
can include plasmids, phage, viruses, cosmids and the like. Desirable vectors 
include those useful for mutagenesis of the gene of interest or for expression of 
30 the gene of interest in host cells. The technique of long PCR has made in vitro 

propagation of large constructs possible, so that modifications to the gene of 
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interest, such as mutagenesis or addition of expression signals, and propagation 
of the resulting constructs can occur entirely in vitro without the use of a 
replicating vector or a host cell. 

For expression of a desaturase polypeptide, functional transcriptional 
and translational initiation and termination regions are operably linked to the 
DNA encoding the desaturase polypeptide. Transcriptional and translational 
initiation and termination regions are derived from a variety of nonexclusive 
sources, including the DNA to be expressed, genes known or suspected to be 
capable of expression in the desired system, expression vectors, chemical 
synthesis, or from an endogenous locus in a host cell. Expression in a plant 
tissue and/or plant part presents certain efficiencies, particularly where the 
tissue or part is one which is easily harvested, such as seed, leaves, fruits, 
flowers, roots, etc. Expression can be targeted to that location within the plant 
by using specific regulatory sequences, such as those of USPN 5,463,174, 
USPN 4,943,674, USPN 5,106,739, USPN 5,175,095, USPN 5,420,034, USPN 
5,188,958, and USPN 5,589,379. Alternatively, the expressed protein can be an 
enzyme which produces a product which may be incorporated, either directly or 
upon further modifications, into a fluid fraction from the host plant. In the 
present case, expression of desaturase genes, or antisense desaturase transcripts, 
can alter the levels of specific PUFAs, or derivatives thereof, found in plant 
parts and/or plant tissues. The A5-desaturase polypeptide coding region is 
expressed either by itself or with other genes, in order to produce tissues and/or 
plant parts containing higher proportions of desired PUFAs or in which the 
PUFA composition more closely resembles that of human breast milk (Prieto et 
al, PCT publication WO 95/24494). The termination region can be derived 
from the 3' region of the gene from which the initiation region was obtained or 
from a different gene. A large number of termination regions are known to and 
have been found to be satisfactory in a variety of hosts from the same and 
different genera and species. The termination region usually is selected more as 
a matter of convenience rather than because of any particular property. 
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The choice of a host cell is influenced in part by the desired PUFA 
profile of the transgenic cell, and the native profile of the host cell. As an 
example, for production of linoleic acid from oleic acid, the DNA sequence 
used encodes a polypeptide having A12 desaturase activity, and for production 

5 of GLA from linoleic acid, the DNA sequence used encodes a polypeptide 
having A6 desaturase activity. Use of a host cell which expresses A12 
desaturase activity and lacks or is depleted in Al 5 desaturase activity, can be 
used with an expression cassette which provides for overexpression of A6 
desaturase alone generally is sufficient to provide for enhanced GLA production 

10 in the transgenic cell. Where the host cell expresses A9 desaturase activity, 
expression of both a A 12- and a A6-desaturase can provide for enhanced GLA 
production. In particular instances where expression of A6 desaturase activity is 
coupled with expression of A12 desaturase activity, it is desirable that the host 
cell naturally have, or be mutated to have, low A15 desaturase activity. 

1 5 Alternatively, a host cell for A6 desaturase expression may have, or be mutated 
to have, high A12 desaturase activity. 

Expression in a host cell can be accomplished in a transient or stable 
fashion. Transient expression can occur from introduced constructs which 
contain expression signals functional in the host cell, but which constructs do 

20 not replicate and rarely integrate in the host cell, or where the host cell is not 
proliferating. Transient expression also can be accomplished by inducing the 
activity of a regulatable promoter operably linked to the gene of interest, 
although such inducible systems frequently exhibit a low basal level of 
expression. Stable expression can be achieved by introduction of a construct 

25 that can integrate into the host genome or that autonomously replicates in the 
host cell. Stable expression of the gene of interest can be selected for through 
the use of a selectable marker located on or transfected with the expression 
construct, followed by selection for cells expressing the marker. When stable 
expression results from integration, integration of constructs can occur 

30 randomly within the host genome or can be targeted through the use of 

constructs containing regions of homology with the host genome sufficient to 
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target recombination with the host locus. Where constructs are targeted to an 
endogenous locus, all or some of the transcriptional and translational regulatory 
regions can be provided by the endogenous locus. 

When increased expression of the desaturase polypeptide in the source 
5 plant is desired, several methods can be employed. Additional genes encoding 
the desaturase polypeptide can be introduced into the host organism. 
Expression from the native desaturase locus also can be increased through 
homologous recombination, for example by inserting a stronger promoter into 
the host genome to cause increased expression, by removing destabilizing 
1 0 sequences from either the mRNA or the encoded protein by deleting that 
information from the host genome, or by adding stabilizing sequences to the 
mRNA (see USPN 4,910,141 and USPN 5,500,365.) 

When it is desirable to express more than one different gene, appropriate 
regulatory regions and expression methods, introduced genes can be propagated 

15 in the host cell through use of replicating vectors or by integration into the host 
genome. Where two or more genes are expressed from separate replicating 
vectors, it is desirable that each vector has a different means of replication. 
Each introduced construct, whether integrated or not, should have a different 
means of selection and should lack homology to the other constructs to maintain 

20 stable expression and prevent reassortment of elements among constructs. 
Judicious choices of regulatory regions, selection means and method of 
propagation of the introduced construct can be experimentally determined so 
that all introduced genes are expressed at the necessary levels to provide for 
synthesis of the desired products. 

25 Constructs comprising the gene of interest may be introduced into a host 

cell by standard techniques. These techniques include transfection, infection, 
holistic impact, electroporation, microinjection, scraping, or any other method 
which introduces the gene of interest into the host cell (see USPN 4,743,548, 
USPN 4,795,855, USPN 5,068,193, USPN 5,188,958, USPN 5,463,174, USPN 

30 5,565,346 and USPN 5,565,347). For convenience, a host cell which has been 
manipulated by any method to take up a DNA sequence or construct will be 
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referred to as "transformed" or "recombinant" herein. The subject host will 
have at least have one copy of the expression construct and may have two or 
more, depending upon whether the gene is integrated into the genome, 
amplified, or is present on an extrachromosomal element having multiple copy 
5 numbers. 

The transformed host cell can be identified by selection for a marker 
contained on the introduced construct. Alternatively, a separate marker 
construct may be introduced with the desired construct, as many transformation 
techniques introduce many DNA molecules into host cells. Typically, 
1 0 transformed hosts are selected for their ability to grow on selective media. 
Selective media may incorporate an antibiotic or lack a factor necessary for 
growth of the untransformed host, such as a nutrient or growth factor. An 
introduced marker gene therefor may confer antibiotic resistance, or encode an 
essential growth factor or enzyme, and permit growth on selective media when 
expressed in the transformed host cell. Desirably, resistance to kanamycin and 
the amino glycoside G418 are of interest (see USPN 5,034,322). Selection of a 
transformed host can also occur when the expressed marker protein can be 
detected, either directly or indirectly. The marker protein may be expressed 
alone or as a fusion to another protein. The marker protein can be detected by 
20 its enzymatic activity; for example p galactosidase can convert the substrate X- 
gal to a colored product, and luciferase can convert luciferin to a light-emitting 
product. The marker protein can be detected by its light-producing or 
modifying characteristics; for example, the green fluorescent protein of 
Aequorea victoria fluoresces when illuminated with blue light. Antibodies can 
be used to detect the marker protein or a molecular tag on, for example, a 
protein of interest. Cells expressing the marker protein or tag can be selected, 
for example, visually, or by techniques such as FACS or panning using 
antibodies. 



15 



25 



30 



The PUFAs produced using the subject methods and compositions may 
be found in the host plant tissue and/or plant part as free fatty acids or in 
conjugated forms such as acylglycerols, phospholipids, sulfolipids or 
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glycolipids, and may be extracted from the host cell through a variety of means 
well-known in the art. Such means may include extraction with organic 
solvents, sonication, supercritical fluid extraction using for example carbon 
dioxide, and physical means such as presses, or combinations thereof. Of 
particular interest is extraction with hexane or methanol and chloroform. Where 
desirable, the aqueous layer can be acidified to protonate negatively charged 
moieties and thereby increase partitioning of desired products into the organic 
layer. After extraction, the organic solvents can be removed by evaporation 
under a stream of nitrogen. When isolated in conjugated forms, the products are 
enzymatically or chemically cleaved to release the free fatty acid or a less 
complex conjugate of interest, and are then subjected to further manipulations to 
produce a desired end product. Desirably, conjugated forms of fatty acids are 
cleaved with potassium hydroxide. 

PURIFICATION OF FATTY ACIDS 

If further purification is necessary, standard methods can be employed. 
Such methods include extraction, treatment with urea, fractional crystallization, 
HPLC, fractional distillation, silica gel chromatography, high speed 
centrifugation or distillation, or combinations of these techniques. Protection of 
reactive groups, such as the acid or alkenyl groups, may be done at any step 
through known techniques, for example alkylation or iodination. Methods used 
include methylation of the fatty acids to produce methyl esters. Similarly, 
protecting groups may be removed at any step. Desirably, purification of 
fractions containing ARA, DHA and EPA is accomplished by treatment with 
urea and/or fractional distillation. 

USES OF FATTY ACIDS 

The uses of the fatty acids of subject invention are several. Probes based 
on the DNAs of the present invention may find use in methods for isolating 
related molecules or in methods to detect organisms expressing desaturases. 
When used as probes, the DNAs or oligonucleotides need to be detectable. This 
is usually accomplished by attaching a label either at an internal site, for 
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example via incorporation of a modified residue, or at the 5 5 or 3' terminus. 
Such labels can be directly detectable, can bind to a secondary molecule that is 
detectably labeled, or can bind to an unlabelled secondary molecule and a 
detectably labeled tertiary molecule; this process can be extended as long as is 

5 practical to achieve a satisfactorily detectable signal without unacceptable levels 
of background signal. Secondary, tertiary, or bridging systems can include use 
of antibodies directed against any other molecule, including labels or other 
antibodies, or can involve any molecules which bind to each other, for example 
a biotin-streptavidin/avidin system. Detectable labels typically include 

10 radioactive isotopes, molecules which chemically or enzymatically produce or 
alter light, enzymes which produce detectable reaction products, magnetic 
molecules, fluorescent molecules or molecules whose fluorescence or light- 
emitting characteristics change upon binding. Examples of labelling methods 
can be found in USPN 5,01 1,770. Alternatively, the binding of target molecules 

1 5 can be directly detected by measuring the change in heat of solution on binding 
of probe to target via isothermal titration calorimetry, or by coating the probe or 
target on a surface and detecting the change in scattering of light from the 
surface produced by binding of target or probe, respectively, as may be done 
with the BIAcore system. 

20 PUFAs of the subject invention produced by recombinant means find 

applications in a wide variety of areas. Supplementation of humans or animals 
with PUFAs in various forms can result in increased levels not only of the 
added PUFAs, but of their metabolic progeny as well. For example, where the 
inherent A6-desaturase pathway is dysfunctional in an individual, treatment with 

25 GLA can result not only in increased levels of GLA, but also of downstream 
products such as ARA and prostaglandins (see Figure 1). Complex regulatory 
mechanisms can make it desirable to combine various PUFAs, or to add 
different conjugates of PUFAs, in order to prevent, control or overcome such 
mechanisms to achieve the desired levels of specific PUFAs in an individual. 

30 PUFAs, or derivatives thereof, made by the disclosed method can be 

used as dietary supplements, particularly in infant formulas, for patients 
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undergoing intravenous feeding or for preventing or treating malnutrition. 
Particular fatty acids such as EPA are used to alter the composition of infant 
formulas to better replicate the PUFA composition of human breast milk. The 
predominant triglyceride in human milk has been reported to be l,3-di-oleoyl-2- 
5 palmitoyl, with 2-palmitoyl glycerides reported as better absorbed than 2-oleoyl 
or 2-lineoyl glycerides (USPN 4,876,107). Typically, human breast milk has a 
fatty acid profile comprising from about 0. 15 % to about 0.36 % as DHA, from 
about 0.03 % to about 0. 1 3 % as EPA, from about 0.30 % to about 0.88 % as 
ARA, from about 0.22 % to about 0.67 % as DGLA, and from about 0.27 % to 

10 about 1.04 % as GLA. A preferred ratio of GLA:DGLA:ARA in infant 
formulas is from about 1 : 1 :4 to about 1:1:1, respectively. Amounts of oils 
providing these ratios of PUFA can be determined without undue 
experimentation by one of skill in the art. PUFAs, or host cells containing 
them, also can be used as animal food supplements to alter an animal's tissue or 

1 5 milk fatty acid composition to one more desirable for human or animal 
consumption. 

NUTRITIONAL COMPOSITIONS 

The present invention also includes nutritional compositions. Such 
compositions, for purposes of the present invention, include any food or 
preparation for human consumption including for enteral or parenteral 
consumption, which when taken into the body (a) serve to nourish or build up 
tissues or supply energy and/or (b) maintain, restore or support adequate 
nutritional status or metabolic function. 

Thejiutritional composition of the present invention comprises at least 
one oil or acid produced in accordance with the present invention and may 
either be in a solid or liquid form. Additionally, the composition may include 
edible macronutrients, vitamins and minerals in amounts desired for a particular 
use. The amount of such ingredients will vary depending on whether the 
composition is intended for use with normal, healthy infants, children or adults 
having specialized needs such as those which accompany certain metabolic 
conditions (e.g., metabolic disorders). 
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Examples of macronutrients which may be added to the composition 
include but are not limited to edible fats, carbohydrates and proteins. Examples 
of such edible fats include but are not limited to coconut oil, soy oil, and mono- 
and diglycerides. Examples of such carbohydrates include but are not limited to 
5 glucose, edible lactose and hydrolyzed search. Additionally, examples of 
proteins which may be utilized in the nutritional composition of the invention 
include but are not limited to soy proteins, electrodialysed whey , 
electrodialysed skim milk, milk whey, or the hydrolysates of these proteins. 

With respect to vitamins and minerals, the following may be added to 
10 the nutritional compositions of the present invention: calcium, phosphorus, 
potassium, sodium, chloride, magnesium, manganese, iron, copper, zinc, 
selenium, iodine, and Vitamins A, E, D, C, and the B complex. Other such 
vitamins and minerals may also be added. 

The components utilized in the nutritional compositions of the present 
15 invention will of semi-purified or purified origin. By semi-purified or purified 
is meant a material which has been prepared by purification of a natural 
material or by synthesis. 

Examples of nutritional compositions of the present invention include 
but are not limited to infant formulas, dietary supplements, and rehydration 
20 compositions. Nutritional compositions of particular interest include but are not 
limited to those utilized for enteral and parenteral supplementation for infants, 
specialist infant formulae, supplements for the elderly, and supplements for 
those with gastrointestinal difficulties and/or malabsorption. 

Nutritional Compositions 

25 A typical nutritional composition of the present invention will contain 

edible macronutrients, vitamins and minerals in amounts desired for a particular 
use. The amounts of such ingredients will vary depending on whether the 
formulation is intended for use with normal, healthy individuals temporarily 
exposed to stress, or to subjects having specialized needs due to certain chronic 

30 or acute disease states (e.g., metabolic disorders). It will be understood by 
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persons skilled in the art that the components utilized in a nutritional 
formulation of the present invention are of semi-purified or purified origin. By 
semi-purified or purified is meant a material that has been prepared by 
purification of a natural material or by synthesis. These techniques are well 
5 known in the art (See, e.g., Code of Federal Regulations for Food Ingredients 
and Food Processing; Recommended Dietary Allowances, 10 th Ed., National 
Academy Press, Washington, D.C., 1989). 

In a preferred embodiment, a nutritional formulation of the present 
invention is an enteral nutritional product, more preferably an adult or child 
10 enteral nutritional product. Accordingly in a further aspect of the invention, a 
nutritional formulation is provided that is suitable for feeding adults or children 
who are experiencing stress. The formula comprises, in addition to the PUFAs 
of the invention; macronutrients, vitamins and minerals in amounts designed to 
provide the daily nutritional requirements of adults. 

15 The macronutritional components include edible fats, carbohydrates and 

proteins. Exemplary edible fats are coconut oil, soy oil, and mono- and 
diglycerides and the PUFA oils of this invention. Exemplary carbohydrates are 
glucose, edible lactose and hydrolyzed cornstarch. A typical protein source 
would be soy protein, electrodialysed whey or electrodialysed skim milk or milk 

20 whey, or the hydrolysates of these proteins, although other protein sources are 
also available and may be used. These macronutrients would be added in the 
form of commonly accepted nutritional compounds in amount equivalent to 
those present in human milk or an energy basis, i.e., on a per calorie basis. 

Methods for formulating liquid and enteral nutritional formulas are well 
25 known in the art and are described in detail in the examples. 

The enteral formula can be sterilized and subsequently utilized on a 
ready-to-feed (RTF) basis or stored in a concentrated liquid or a powder. The 
powder can be prepared by spray drying the enteral formula prepared as 
indicated above, and the formula can be reconstituted by rehydrating the 
30 concentrate. Adult and infant nutritional formulas are well known in the art and 
commercially available (e.g., Similac®, Ensure®, Jevity® and Alimentum® 
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from Ross Products Division, Abbott Laboratories). An oil or acid of the 
present invention can be added to any of these formulas in the amounts 
described below. 

The energy density of the nutritional composition when in liquid form, 
5 can typically range from about 0.6 Kcal to 3 Kcal per ml. When in solid or 
powdered form, the nutritional supplement can contain from about 1.2 to more 
than 9 Kcals per gm, preferably 3 to 7 Kcals per gm. In general, the osmolality 
of a liquid product should be less than 700 mOsm and more preferably less than 
660 mOsm. 

1 0 The nutritional formula would typically include vitamins and minerals, 

in addition to the PUFAs of the invention, in order to help the individual ingest 
the minimum daily requirements for these substances. In addition to the PUFAs 
listed above, it may also be desirable to supplement the nutritional composition 
with zinc, copper, and folic acid in addition to antioxidants. It is believed that 

1 5 these substances will also provide a boost to the stressed immune system and 
thus will provide further benefits to the individual. The presence of zinc, 
copper or folic acid is optional and is not required in order to gain the beneficial 
effects on immune suppression. Likewise a pharmaceutical composition can be 
supplemented with these same substances as well. 

20 In a more preferred embodiment, the nutritional contains, in addition to 

the antioxidant system and the PUFA component, a source of carbohydrate 
wherein at least 5 weight % of said carbohydrate is an indigestible 
oligosaccharide. In yet a more preferred embodiment, the nutritional 
composition additionally contains protein, taurine and carnitine. 

25 The PUFAs, or derivatives thereof, made by the disclosed method can 

be used as dietary substitutes, or supplements, particularly infant formulas, for 
patients undergoing intravenous feeding or for preventing or treating 
malnutrition. Typically, human breast milk has a fatty acid profile comprising 
from about 0.15 % to about 0.36 % as DHA, from about 0.03 % to about 0.13 % 

30 as EPA, from about 0.30 % to about 0.88 % as ARA, from about 0.22 % to 
about 0.67 % as DGLA, and from about 0.27 % to about 1.04 % as GLA. 
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Additionally, the predominant triglyceride in human milk has been reported to 
be l,3-di-oleoyl-2-palmitoyl, with 2-palmitoyl glycerides reported as better 
absorbed than 2-oleoyl or 2-lineoyl glycerides (USPN 4,876,107). Thus, fatty 
acids such as ARA, DGLA, GL A and/or EPA produced by the invention can be 
5 used to alter the composition of infant formulas to better replicate the PUF A 
composition of human breast milk. In particular, an oil composition for use in a 
pharmacologic or food supplement, particularly a breast milk substitute or 
supplement, will preferably comprise one or more of ARA, DGLA and GLA. 
More preferably the oil will comprise from about 0.3 to 30% ARA, from about 
1 0 0.2 to 30% DGLA, and from about 0.2 to about 30% GLA. 

In addition to the concentration, the ratios of ARA, DGLA and GLA can 
be adapted for a particular given end use. When formulated as a breast milk 
supplement or substitute, an oil composition which contains two or more of 
ARA, DGLA and GLA will be provided in a ratio of about 1:19:30 to about 
6:1:0.2, respectively. For example, the breast milk of animals can vary in ratios 
of ARA:DGLA:DGL ranging from 1:19:30 to 6:1:0.2, which includes 
intermediate ratios which are preferably about 1:1:1, 1:2:1, 1:1:4. When 
produced together in a host cell, adjusting the rate and percent of conversion of 
a precursor substrate such as GLA and DGLA to ARA can be used to precisely 
20 control the PUFA ratios. For example, a 5% to 10% conversion rate of DGLA 
to ARA can be used to produce an ARA to DGLA ratio of about 1:19, whereas 
a conversion rate of about 75% to 80% can be used to produce an ARA to 
DGLA ratio of about 6:1. Therefore, whether in a cell culture system or in a 
host animal, regulating the timing, extent and specificity of desaturase 
expression as described can be used to modulate the PUFA levels and ratios. 
Depending on the expression system used, e.g., cell culture or an animal 
expressing oil(s) in its milk, the oils also can be isolated and recombined in the 
desired concentrations and ratios. Amounts of oils providing these ratios of 
PUFA can be determined following standard protocols. PUFAs, or host cells 
containing them, also can be used as animal food supplements to alter an 
animal's tissue or milk fatty acid composition to one more desirable for human 
or animal consumption. 
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For dietary supplementation, the purified PUFAs, or derivatives thereof, 
may be incorporated into cooking oils, fats or margarines formulated so that in 
normal use the recipient would receive the desired amount. The PUFAs may 
also be incorporated into infant formulas, nutritional supplements or other food 
5 products, and may find use as anti-inflammatory or cholesterol lowering agents. 



The present invention also encompasses a pharmaceutical composition 
comprising one or more of the acids and/or resulting oils produced in 
accordance with the methods described herein. More specifically, such a 

1 0 pharmaceutical composition may comprise one or more of the acids and/or oils 
as well as a standard, well-known, non-toxic pharmaceutically acceptable 
carrier, adjuvant or vehicle such as, for example, phosphate buffered saline, 
water, ethanol, polyols, vegetable oils, a wetting agent or an emulsion such as a 
water/oil emulsion. The composition may be in either a liquid or solid form. 

1 5 For example, the composition may be in the form of a tablet, capsule, ingestible 
liquid or powder, injectible, or topical ointment or cream. 

Possible routes of administration include, for example, oral, rectal and 
parenteral. The route of administration will, of course, depend upon the desired 
effect. For example, if the composition is being utilized to treat rough, dry, or 
20 aging skin, to treat injured or burned skin, or to treat skin or hair affected by a 
disease or condition, it may perhaps be applied topically. 

The dosage of the composition to be administered to the patient may be 
determined by one of ordinary skill in the art and depends upon various factors 
such as weight of the patient, age of the patient, immune status of the patient, 



With respect to form, the composition may be, for example, a solution, a 
dispersion, a suspension, an emulsion or a sterile powder which is then 
reconstituted. 



Pharmaceutical Compositions 



25 



etc. 
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Additionally, the composition of the present invention may be utilized 
for cosmetic purposes. It may be added to pre-existing cosmetic compositions 
such that a mixture is formed or may be used as a sole composition. 

Pharmaceutical compositions may be utilized to administer the PUFA 
5 component to an individual. Suitable pharmaceutical compositions may 

comprise physiologically acceptable sterile aqueous or non-aqueous solutions, 
dispersions, suspensions or emulsions and sterile powders for reconstitution into 
sterile solutions or dispersions for ingestion. Examples of suitable aqueous and 
non-aqueous carriers, diluents, solvents or vehicles include water, ethanol, 

1 0 polyols (propyleneglycol, polyethyleneglycol, glycerol, and the like), suitable 
mixtures thereof, vegetable oils (such as olive oil) and injectable organic esters 
such as ethyl oleate. Proper fluidity can be maintained, for example, by the 
maintenance of the required particle size in the case of dispersions and by the 
use of surfactants. It may also be desirable to include isotonic agents, for 

1 5 example sugars, sodium chloride and the like. Besides such inert diluents, the 
composition can also include adjuvants, such as wetting agents, emulsifying and 
suspending agents, sweetening, flavoring and perfuming agents. 

Suspensions, in addition to the active compounds, may contain 
suspending agents, as for example, ethoxylated isostearyl alcohols, 
20 polyoxyethylene sorbitol and sorbitan esters, microcrystalline cellulose, 

aluminum metahydroxide, bentonite, agar-agar and tragacanth or mixtures of 
these substances, and the like. 

Solid dosage forms such as tablets and capsules can be prepared using 
techniques well known in the art. For example, PUFAs of the invention can be 

25 tableted with conventional tablet bases such as lactose, sucrose, and cornstarch 
in combination with binders such as acacia, cornstarch or gelatin, disintegrating 
agents such as potato starch or alginic acid and a lubricant such as stearic acid 
or magnesium stearate. Capsules can be prepared by incorporating these 
excipients into a gelatin capsule along with the antioxidants and the PUFA 

30 component. The amount of the antioxidants and PUFA component that should 
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be incorporated into the pharmaceutical formulation should fit within the 
guidelines discussed above. 

As used in this application, the term "treat" refers to either preventing, or 
reducing the incidence of, the undesired occurrence. For example, to treat 

5 immune suppression refers to either preventing the occurrence of this 

suppression or reducing the amount of such suppression. The terms "patient" 
and "individual" are being used interchangeably and both refer to an animal. 
The term "animal" as used in this application refers to any warm-blooded 
mammal including, but not limited to, dogs, humans, monkeys, and apes. As 

1 0 used in the application the term "about" refers to an amount varying from the 
stated range or number by a reasonable amount depending upon the context of 
use. Any^numerical number or range specified in the specification should be 
considered to be modified by the term about. 

"Dose" and "serving" are used interchangeably and refer to the amount 
15 of the nutritional or pharmaceutical composition ingested by the patient in a 
single setting and designed to deliver effective amounts of the antioxidants and 
the structured triglyceride. As will be readily apparent to those skilled in the 
art, a single dose or serving of the liquid nutritional powder should supply the 
amount of antioxidants and PUFAs discussed above. The amount of the dose or 
20 serving should be a volume that a typical adult can consume in one sitting. This 
amount can vary widely depending upon the age, weight, sex or medical 
condition of the patient. However as a general guideline, a single serving or 
dose of a liquid nutritional produce should be considered as encompassing a 
volume from 100 to 600 ml, more preferably from 125 to 500 ml and most 
25 preferably from 125 to 300 ml. 

The PUFAs of the present invention may also be added to food even 
when supplementation of the diet is not required. For example, the composition 
may be added to food of any type including but not limited to margarines, 
modified butters, cheeses, milk, yogurt, chocolate, candy, snacks, salad oils, 
30 cooking oils, cooking fats, meats, fish and beverages. 
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Pharmaceutical Applications 

For pharmaceutical use (human or veterinary), the compositions are 
generally administered orally but can be administered by any route by which 
they may be successfully absorbed, e.g., parenterally (i.e. subcutaneously, 
5 intramuscularly or intravenously), rectally or vaginally or topically, for 

example, as a skin ointment or lotion. The PUFAs of the present invention may 
be administered alone or in combination with a pharmaceutically acceptable 
carrier or excipient. Where available, gelatin capsules are the preferred form of 
oral administration. Dietary supplementation as set forth above also can 

1 0 provide an oral route of administration. The unsaturated acids of the present 
invention may be administered in conjugated forms, or as salts, esters, amides 
or prodrugs of the fatty acids. Any pharmaceutically acceptable salt is 
encompassed by the present invention; especially preferred are the sodium, 
potassium or lithium salts. Also encompassed are the N-alkylpolyhydrdxamine 

15 salts, such as N-methyl glucamine, found in PCT publication WO 96/33155. 
The preferred esters are the ethyl esters. As solid salts, the PUFAs also can be 
administered in tablet form. For intravenous administration, the PUFAs or 
derivatives thereof may be incorporated into commercial formulations such as 
Intralipids. The typical normal adult plasma fatty acid profile comprises 6.64 to 

20 9.46% of ARA, 1.45 to 3.1 1% of DGLA, and 0.02 to 0.08% of GLA. These 
PUFAs or their metabolic precursors can be administered, either alone or in 
mixtures with other PUFAs, to achieve a normal fatty acid profile in a patient. 
Where desired, the individual components of formulations may be individually 
provided in kit form, for single or multiple use. A typical dosage of a particular 

25 fatty acid is from 0. 1 mg to 20 g, or even 1 00 g daily, and is preferably from 1 0 
mg to 1, 2, 5 or 10 g daily as required, or molar equivalent amounts of 
derivative forms thereof. Parenteral nutrition compositions comprising from 
about 2 to about 30 weight percent fatty acids calculated as triglycerides are 
encompassed by the present invention; preferred is a composition having from 

30 about 1 to about 25 weight percent of the total PUFA composition as GLA 

(USPN 5,196,198). Other vitamins, and particularly fat-soluble vitamins such 

as vitamin A, D, E and L-carnitine can optionally be included. Where desired a 
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preservative such as a tocopherol may be added, typically at about 0.1% by 
weight. 

Suitable pharmaceutical compositions may comprise physiologically 
acceptable sterile aqueous or non-aqueous solutions, dispersions, suspensions or 
5 emulsions and sterile powders for reconstitution into sterile injectible solutions 
or dispersions. Examples of suitable aqueous and non-aqeuous carriers, 
diluents, solvents or vehicles include water, ethanol, polyols (propylleneglyol, 
polyethylenegycol, glycerol, and the like), suitable mixtures thereof, vegetable 
oils (such as olive oil) and injectable organic esters such as ehyl oleate. Proper 

10 fluidity can be maintained, for example, by the maintenance of the required 

particle size in the case of dispersions and by the use of surfactants. It may also 
be desirable to include isotonic agents, for example sugars, sodium chloride and 
the like. Besides such inert diluents, the composition can also include 
adjuvants, such as wetting agents, emulsifying and suspending agents, 

1 5 sweetening, flavoring and perfuming agents. 

Suspensions in addition to the active compounds, may contain 
suspending agents, as for example, ethoxylated isostearyl alcohols, 
polyoxyethylene sorbitol and sorbitan esters, microcrystalline cellulose, 
aluminum metahydroxide, bentonite, agar-agar and tragacanth, or mixtures of 
20 these substances and the like. 

An especially preferred pharmaceutical composition contains 
diacetyltartaric acid esters of mono- and diglycerides dissolved in an aqueous 
medium or solvent. Diacetyltartaric acid esters of mono- and diglycerides have 
an HLB value of about 9-12 and are significantly more hydrophilic than existing 

25 antimicrobial lipids that have HLB values of 2-4. Those existing hydrophobic 
lipids cannot be formulated into aqueous compositions. As disclosed herein, 
those lipids can now be solubilized into aqueous media in combination with 
diacetyltartaric acid esters of mono-and diglycerides. In accordance with this 
embodiment, diacetyltartaric acid esters of mono- and diglycerides (e.g., 

30 DATEM-C12:0) is melted with other active antimicrobial lipids (e.g., 18:2 and 
12:0 monoglycerides) and mixed to obtain a homogeneous mixture. 
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Homogeneity allows for increased antimicrobial activity. The mixture can be 
completely dispersed in water. This is not possible without the addition of 
diacetyltartaric acid esters of mono- and diglycerides and premixing with other 
monoglycerides prior to introduction into water. The aqueous composition can 
5 then be admixed under sterile conditions with physiologically acceptable 

diluents, preservatives, buffers or propellants as may be required to form a spray 
or inhalant. 

The present invention also encompasses the treatment of numerous 
disorders with fatty acids. Supplementation with PUFAs of the present 

1 0 invention can be used to treat restenosis after angioplasty. Symptoms of 

inflammation, rheumatoid arthritis, and asthma and psoriasis can be treated with 
the PUFAs of the present invention. Evidence indicates that PUFAs may be 
involved in calcium metabolism, suggesting that PUFAs of the present 
invention may be used in the treatment or prevention of osteoporosis and of 

1 5 kidney or urinary tract stones. 

The PUFAs of the present invention can be used in the treatment of 
cancer. Malignant cells have been shown to have altered fatty acid 
compositions; addition of fatty acids has been shown to slow their growth and 
cause cell death, and to increase their susceptibility to chemotherapeutic agents. 

20 GLA has been shown to cause reexpression on cancer cells of the E-cadherin 
cellular adhesion molecules, loss of which is associated with aggressive 
metastasis. Clinical testing of intravenous administration of the water soluble 
lithium salt of GLA to pancreatic cancer patients produced statistically 
significant increases in their survival. PUFA supplementation may also be 

25 useful for treating cachexia associated with cancer. 

The PUFAs of the present invention can also be used to treat diabetes 
(USPN 4,826,877; Horrobin et aL 9 Am. J. Clin. Nutr. Vol. 57 (Suppl.), 732S- 
737S). Altered fatty acid metabolism and composition has been demonstrated 
in diabetic animals. These alterations have been suggested to be involved in 
30 some of the long-term complications resulting from diabetes, including 
retinopathy, neuropathy, nephropathy and reproductive system damage. 
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Primrose oil, which contains GLA, has been shown to prevent and reverse 
diabetic nerve damage. 

The PUFAs of the present invention can be used to treat eczema, reduce 
blood pressure and improve math scores. Essential fatty acid deficiency has 

5 been suggested as being involved in eczema, and studies have shown beneficial 
effects on eczema from treatment with GLA. GLA has also been shown to 
reduce increases in blood pressure associated with stress, and to improve 
performance on arithmetic tests. GLA and DGLA have been shown to inhibit 
platelet aggregation, cause vasodilation, lower cholesterol levels and inhibit 

10 proliferation of vessel wall smooth muscle and fibrous tissue (Brenner et ah , 
Adv. Exp. Med. Biol. Vol. 83, p. 85-101, 1976). Administration of GLA or 
DGLA, alone or in combination with EPA, has been shown to reduce or prevent 
gastro-intestinal bleeding and other side effects caused by non-steroidal anti- 
inflammatory drugs (USPN 4,666,701). GLA and DGLA have also been shown 

1 5 to prevent or treat endometriosis and premenstrual syndrome (USPN 4,758,592) 
and to treat myalgic encephalomyelitis and chronic fatigue after viral infections 
(USPN 5,116,871). 

Further uses of the PUFAs of this invention include use in treatment of 
AIDS, multiple schlerosis, acute respiratory syndrome, hypertension and 
20 inflammatory skin disorders. The PUFAs of the inventions also can be used for 
formulas for general health as well as for geriatric treatments. 

Veterinary Applications 

It should be noted that the above-described pharmaceutical and 
nutritional compositions may be utilized in connection with animals, as well as 
25 humans, as animals experience many of the same needs and conditions as 

human. For example, the oil or acids of the present invention may be utilized in 
animal feed supplements or as animal feed substitutes. 

The following examples are presented by way of illustration, not of 
limitation. 
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Examples 

Example 1 Isolation of A5 Desaturase Nucleotide Sequence from 
Mortierella alpina 

Example 2 Isolation of A6 Desaturase Nucleotide Sequence from 
Mortierella alpina 

Example 3 Identification of A6 Desaturases Homologues to the 
Mortierella alpina A Desaturase 

Example 4 Isolation of D- 1 2 Desaturase Nucleotide Sequence from 
Mortierella alpina 

Example 5 Isolation of Cytochrome b5 Reductase Nucleotide 
Sequence from Mortierella alpina 

Example 6 Expression of Af. alpina Desaturase Clones in Baker's 
Yeast 

Example 7 Fatty Acid Analysis of Leaves from Ma29 Transgenic 
Brassica Plants 

Example 8 Expression of M. alpina A6 Desaturase in Brassica 
napus 



20 



25 



Example 9 Expression of M. alpina A 12 desaturase in Brassica 
napus 

Example 10 Simultaneous expression of M. alpina A6 and A 12 
desaturases in Brassica napus 

Example 1 1 Simultaneous expression of M alpina A5 and A6 
desaturases in Brassica napus 

Example 12 Simultaneous expression of M alpina A5, A6 and A 12 
desaturases in Brassica napus 

Example 1 3 Stereospecific Distribution of A6-Desaturated Oils 

Example 14 Fatty Acid Compositions of Transgenic Plants 
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Example 15 Combined Expression of A6 and A 12 Desaturases in B. 
napus Achieved by Crossing 

Example 16 Expression of M alpina desaturases in soybean 

Example 1 7 Human Desaturase Gene Sequences 

5 Example 1 

Isolation of a A5-desaturase Nucleotide Sequence from Mortierella alpina 
Motierella alpina produces arachidonic acid (ARA, 20:4) from the 
precursor 20:3 by a A5-desaturase. A nucleotide sequence encoding the A5- 
desaturase from Mortierella alpina (see Figure 7) was obtained through PCR 
10 amplification using M alpina 1 st strand cDNA and degenerate oligonucleotide 
primers corresponding to amino acid sequences conserved between A6- 
desaturases from Synechocystis and Spirulina* The procedure used was as 
follows: 

Total RNA was isolated from a 3 day old PUFA-producing culture of 
1 5 Mortierella alpina using the protocol of Hoge et al (1982) Experimental 

Mycology 6:225-232. The RNA was used to prepare double-stranded cDNA 
using BRL's lambda-ZipLox system, following the manufacturer's instructions. 
Several size fractions of the M. alpina cDNA were packaged separately to yield 
libraries with different average-sized inserts. The "full-length" library contains 
20 approximately 3 x 10 6 clones with an average insert size of 1.77 kb. The 
"sequencing-grade" library contains approximately 6 x 10 5 clones with an 
average insert size of 1.1 kb. 

5|xg of total RNA was reverse transcribed using BRL Superscript RTase 
and the primer TSyn S'-CAAGCTTCTGCAGGAGCTCTTTTTTTTTTTTTTT- 
25 3' (SEQ ID NO: 19.) Degenerate oligonucleotides were designed to regions 
conserved between the two cyanobacterial A6-desaturase sequences. The 
specific primers used were: 
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D6DESAT-F3 (SEQ ID NO:20) 

5'-CUACUACUACUACAYCAYACOTAYACOAAYAT-3' 
D6DESAT-R3 ( SEQ ID NO:21) 

5 ' -C AUC AUC AUC AUOGGRAAO ARRTGRTG-3 ' 

5 where Y=C+T, R=A+G, and 0=I+C. PCR amplification was carried out in a 
25ul volume containing: template derived from 40 ng total RNA, 2 pM each 
primer, 200 uM each deoxyribonucleotide triphosphate, 60 mM Tris-Cl, pH 8.5, 
15 mM (NH4) 2 S0 4 , 2 mM MgCl 2 . Samples were subjected to an initial 
desaturation step of 95 degrees (all temperatures Celsius) for 5 minutes, then 

1 0 held at 72 degrees while 0.2 U of Taq polymerase were added. PCR 

thermocycling conditions were as follows: 94 degrees for 1 min., 45 degrees 
for 1.5 min., 72 degrees for 2 min. PCR was continued for 35 cycles. PCR 
using these primers on the M. alpina first-strand cDNA produced a 550 bp 
reaction product. Comparison of the deduced amino acid sequence of the M. 

1 5 alpina PCR fragment revealed regions of homology with A6-desaturases (see 
Figure 4). However, there was only about 28% identity over the region 
compared. The deduced amino acid sequence is presented in SEQ ID NO: 14. 

The PCR product was used as a probe to isolate corresponding cDNA 
clones from a M. alpina library. The longest cDNA clone, Ma29, was 

20 designated pCGN552 1 and has been completely sequenced on both strands. 
The cDNA is contained as a 1481 bp insert in the vector pZLl (Bethesda 
Research Laboratories) and, beginning with the first ATG, contains an open 
reading frame encoding 446 amino acids. The reading frame contains the 
sequence deduced from the PCR fragment. The sequence of the cDNA insert 

25 was found to contain regions of homology to A6-desaturases (see Figure 8). For 
example, three conserved "histidine boxes" (that have been observed in other 
membrane-bound desaturases (Okuley et al., (1994) The Plant Cell 6: 147-1 58)) 
were found to be present in the Mortierella sequence at amino acid positions 
171-175, 207-212, and 387-391 (see Figure 5A-5D). However, the typical 

30 "HXXHH" amino acid motif for the third histidine box for the Mortierella 
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desaturase was found to be QXXHH. The amino-terminus of the encoded 
protein, showed significant homology to cytochrome b5 proteins. Thus, the 
Mortierella cDNA clone appears to represent a fusion between a cytochrome b5 
and a fatty acid desaturase. Since cytochrome b5 is believed to function as the 
5 electron donor for membrane-bound desaturase enzymes, it is possible that the 
N-terminal cytochrome b5 domain of this desaturase protein is involved in its 
function. This may be advantageous when expressing the desaturase in 
heterologous systems for PUFA production. 

Example 2 

10 Isolation of A6 Desaturase Nucleotide Sequence from Mortierella alpina 
A nucleic acid sequence from a partial cDNA clone, Ma524, encoding a 
A6 fatty acid desaturase from Mortierella alpina was obtained by random 
sequencing of clones from the M. alpina cDNA library described in Example 1 . 
cDNA-containing plasmids were excised as follows: 

15 Five |il of phage were combined with 100 |iil of E. coli DHIOB(ZIP) 

grown in ECLB plus 10 |ag/ml kanamycin, 0.2% maltose, and 10 mM MgS04 
and incubated at 37 degrees for 15 minutes. 0.9 ml SOC was added and 100 ^il 
of the bacteria immediately plated on each of 10 ECLB + 50 jig Pen plates. No 
45 minute recovery time was needed. The plates were incubated overnight at 37 

20 degrees. Colonies were picked into ECLB + 50 \xg Pen media for overnight 
cultures to be used for making glycerol stocks and miniprep DNA. An aliquot 
of the culture used for the miniprep is stored as a glycerol stock. Plating on 
ECLB + 50 jig Pen/ml resulted in more colonies and a greater proportion of 
colonies containing inserts than plating on 100 ^ml Pen. 

25 Random colonies were picked and plasmid DNA purified using Qiagen 

miniprep kits. DNA sequence was obtained from the 5 f end of the cDNA insert 
and compared to the databases using the BLAST algorithm. Ma524 was 
identified as a putative A6 desaturase based on DNA sequence homology to 
previously identified A6 desaturases. A full-length cDNA clone was isolated 
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from the M. alpina library. The abundance of this clone appears to be slightly 
(2X) less than Ma29. Ma524 displays significant homology to a portion of a 
Caenorhabditis elegans cosmid, W06D2.4, a cytochrome b5/desaturase fusion 
protein from sunflower, and the two A6 desaturases in the public databanks 
5 those from Synechocystis and Spirulina. 

In addition, Ma524 shows significant homology to the borage A6- 
desaturase sequence (PCT publication WO 96/21022). Ma524 thus appears to 
encode a A6-desaturase that is related to the borage and algal A6-desaturases. It 
should be noted that, although the amino acid sequences of Ma524 and the 

10 borage A6 are similar, the base composition of the cDNAs is quite different: the 
borage cDNA has an overall base composition of 60 % A+T, with some regions 
exceeding 70 %, while Ma524 has an average of 44 % A+T base composition, 
with no regions exceeding 60 %. This may have implications for expressing the 
cDNAs in microorganisms or animals which favor different base compositions. 

15 It is known that poor expression of recombinant genes can occur when the host 
has a very different base composition from that of the introduced gene. 
Speculated mechanisms for such poor expression include decreased stability or 
translatability of the mRNA. 

Example 3 

20 Identificat ion of A6-desatu rases Homologous 

to the Mortierella alpina A6-desaturase 

Nucleic acid sequences that encode putative A6-desaturases were 

identified through a BLASTX search of the est databases through NCBI using 

the Ma524 amino acid sequence. Several sequences showed significant 

25 homology. In particular, the deduced amino acid sequence of two Arabidopsis 
thaliana sequences, (accession numbers F13728 and T42806) showed 
homology to two different regions of the deduced amino acid sequence of 
Ma524. The following PCR primers were designed: ATTS4723-FOR 
(complementary to F13728) 5'-CUACUACUACUAGGAGTCCTCTA 

30 CGGTGTTTTG, SEQ ID NO:22, and T42806-REV (complementary to 
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T42806) 5' CAUCAUCAUCAUATGATGCTCAAGCTGAAACTG, SEQ ID 
NO:23. Five ng of total RNA isolated from developing siliques of Arabidopsis 
thaliana was reverse transcribed using BRL Superscript RTase and the primer 
TSyn 5 , -CCAAGCTTCTGCAGGAGCTC^lTlTIT^rrrr^r^-3^ (SEQ ID 

5 NO:24). PCR was carried out in a 50 ul volume containing: template derived 
from 25 ng total RNA, 2 pM each primer, 200 \iM each deoxyribonucleotide 
triphosphate, 60 mM Tris-Cl, pH 8.5, 15 mM (NH^SOii, 2 mM MgCl 2 , 0.2 U 
Taq Polymerase. Cycle conditions were as follows: 94 degrees for 30 sec, 50 
degrees for 30 sec, 72 degrees for 30 sec PCR was continued for 35 cycles 

10 followed by an additional extension at 72 degrees for 7 minutes. PCR resulted 
in a fragment of -750 base pairs which was subsequently subcloned, named 12- 
5, and sequenced. Each end of this fragment corresponds to the Arabidopsis 
est from which the PCR primers were derived. This is the sequence named 12-5. 
The deduced amino acid sequence of 12-5 is compared to that of Ma524 and 

1 5 ests from human (W28140), mouse (W53753), and C elegans (R05219) in 
Figure 4. Based on homology, these sequences represent desaturase 
polypeptides. The full-length genes can be cloned using probes based on the est 
sequences. The genes can then be placed in expression vectors and expressed in 
host cells and their specific A6- or other desaturase activity can be determined 

20 as described below. 

Example 4 

Isolation of A-12 Desaturase Nucleotide Sequence from Mortierella alvina 
Based on the fatty acids it accumulates, Mortierella alpina has an co6 
type desaturase. The co6 desaturase is responsible for the production of linoleic 
25 acid (18:2) from oleic acid (18:1). Linoleic acid ( 1 8 :2) is a substrate for a A6 
desaturase. This experiment was designed to determine if Mortierella alpina 
has a A12-desaturase polypeptide, and if so, to identify the corresponding 
nucleotide sequence. A random colony from the M alpina sequencing grade 
library, Ma648, was sequenced and identified as a putative desaturase based on 
30 DNA sequence homology to previously identified desaturases, as described for 
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Ma524 (see Example 2). The deduced amino acid sequence from the 5' end of 
the Ma648 cDNA displays significant homology to soybean microsomal ©6 
(A12) desaturase (accession #L43921) as well as castor bean oleate 12- 
hydroxylase (accession #U22378). In addition, homology is observed to a 
5 variety of other a>6 (A12) and a>3 (Al 5) fatty acid desaturase sequences. 

Example 5 

Isolation of Cytoch rome b5 Reductase Nucleotide Seq uence 
from Mortierella alpina 

A nucleic acid sequence encoding a cytochrome b5 reductase from 

1 0 Mortierella alpina was obtained as follows. A cDNA library was constructed 

based on total RNA isolated from Mortierella alpina as described in Example 1 . 

DNA sequence was obtained from the 5' and 3' ends of one of the clones, M12- 

27. A search of public databanks with the deduced amino acid sequence of the 

3' end of Ml 2-27 (see Figure 5) revealed significant homology to known 

1 5 cytochrome b5 reductase sequences. Specifically, over a 49 amino acid region, 

the Mortierella clone shares 55% identity (73% homology) with a cytochrome 

b5 reductase from pig (see Figure 4). 



Example 6 

Expression of M. alpina Desaturase Clones in Baker's Yeast 
Yeast Transformation 

Lithium acetate transformation of yeast was performed according to 
standard protocols (Methods in Enzymology, Vol. 194, p. 186-187, 1991). 
Briefly, yeast were grown in YPD at 30°C. Cells were spun down, resuspended 
in TE, spun down again, resuspended in TE containing 100 mM lithium acetate, 
spun down again, and resuspended in TE/lithium acetate. The resuspended 
yeast were incubated at 30°C for 60 minutes with shaking. Carrier DNA was 
added, and the yeast were aliquoted into tubes. Transforming DNA was added, 
and the tubes were incubated for 30 min. at 30°C. PEG solution (35% (w/v) 
PEG 4000, 100 mM lithium acetate, TE pH7.5) was added followed by a 50 
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min. incubation at 30°C. A 5 min. heat shock at 42°C was performed, the cells 
were pelleted, washed with TE, pelleted again and resuspended in TE. The 
resuspended cells were then plated on selective media. 

Desaturase Expression in Transformed Yeast 

5 cDNA clones from Mortierella alpina were screened for desaturase 

activity in baker's yeast. A canola A15-desaturase (obtained by PCR using 1 st 
strand cDNA from Brassica napus cultivar 212/86 seeds using primers based on 
the published sequence (Arondel et al Science 258:1353-1355)) was used as a 
positive control. The A15-desaturase gene and the gene from cDNA clone 

1 0 Ma29 was put in the expression vector p YES2 (Invitrogen), resulting in 

plasmids pCGR-2 and pCGR-4, respectively. These plasmids were transfected 
into S. cerevisiae yeast strain 334 and expressed after induction with galactose 
and in the presence of substrates that allowed detection of specific desaturase 
activity. The control strain was S. cerevisiae strain 334 containing the unaltered 

1 5 p YES2 vector. The substrates used, the products produced and the indicated 
desaturase activity were: DGLA (conversion to ARA would indicate A5- 
desaturase activity), linoleic acid (conversion to GLA would indicate A6- 
desaturase activity; conversion to ALA would indicate A15-desaturase activity), 
oleic acid (an endogenous substrate made by *S. cerevisiae, conversion to 

20 linoleic acid would indicate A12-desaturase activity, which S. cerevisiae lacks), 
or ARA (conversion to EPA would indicate Al 7-desaturase activity). The 
results are provided in Table 1 below. The lipid fractions were extracted as 
follows: Cultures were grown for 48-52 hours at 15°C. Cells were pelleted by 
centrifugation, washed once with sterile ddH 2 0, and repelleted. Pellets were 

25 vortexed with methanol; chloroform was added along with tritridecanoin (as an 
internal standard). The mixtures were incubated for at least one hour at room 
temperature or at 4°C overnight. The chloroform layer was extracted and 
filtered through a Whatman filter with one gram of anhydrous sodium sulfate to 
remove particulates and residual water. The organic solvents were evaporated 

30 at 40°C under a stream of nitrogen. The extracted lipids were then derivatized 
to fatty acid methyl esters (FAME) for gas chromatography analysis (GC) by 
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adding 2 ml of 0.5 N potassium hydroxide in methanol to a closed tube. The 
samples were heated to 95°C to 100°C for 30 minutes and cooled to room 
temperature. Approximately 2 ml of 14 % boron trifluoride in methanol was 
added and the heating repeated. After the extracted lipid mixture cooled, 2 ml 
5 of water and 1 ml of hexane were added to extract the FAME for analysis by 
GC. The percent conversion was calculated by dividing the product produced 
by the sum of (the product produced and the substrate added) and then 
multiplying by 100. To calculate the oleic acid percent conversion, as no 
substrate was added, the total linoleic acid produced was divided by the sum of 
10 (oleic acid and linoleic acid produced), then multiplying by 100. 
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Table 1 



M. alpina Desaturase Expression in Baker's Yeast 



CLONE 



TYPE OF ENZYME 
ACTIVITY 



% CONVERSION 
OF SUBSTRATE 



pCGR-2 
(canola A15 
desaturase) 



A6 

A15 

A5 

A17 

A12 



0 (18:2 to 18:3<o6) 
163 (18:2 to 18:3a>3) 
2.0 (20:3to20:4a>6) 
2.8 (20:4to20:5a>3) 
1.8 (18:1 to 18:2a>6) 



pCGR-4 
(M. alpina 
A6-Iike, Ma29) 



A6 

A15 

A5 

A17 

A12 



0 
0 

15.3 

0.3 

3.3 



pCGR-7 
(M. alpina 
A12-like, Ma648 



A6 

A15 

A5 

A17 

A12 



0 

3.8 
2.2 
0 

63.4 



The A15-desaturase control clone exhibited 16.3% conversion of the 
5 substrate. The pCGR-4 clone expressing the Ma29 cDNA converted 15.3% of 
the 20:3 substrate to 20:4w6, indicating that the gene encodes a A5-desaturase. 
The background (non-specific conversion of substrate) was between 0-3% in 
these cases. The pCGR-5 clone expressing the Ma524 cDNA showed 6% 
conversion of the substrate to GL A, indicating that the gene encodes a A6- 
10 desaturase. The pCGR-7 clone expressing the Ma648 cDNA converted 63.4% 
conversion of the substrate to LA, indicating that the gene encodes a A12- 
desaturase. Substrate inhibition of activity was observed by using different 
concentrations of the substrate. When substrate was added to 100 jiM, the 
percent conversion to product dropped as compared to when substrate was 
1 5 added to 25 jiM (see below). These data show that desaturases with different 
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substrate specificities can be expressed in a heterologous system and used to 
produce PUFAs. 

Table 2 represents fatty acids of interest as a percent of the total lipid 
extracted from the yeast host S. cerevisiae 334 with the indicated plasmid. No 
5 glucose was present in the growth media. Affinity gas chromatography was 
used to separate the respective lipids. GC/MS was employed to verify the 
identity of the produces). The expected product for the 5. napus A15- 
desaturase, a-linolenic acid, was detected when its substrate, linoleic acid, was 
added exogenously to the induced yeast culture. This finding demonstrates that 

1 0 yeast expression of a desaturase gene can produce functional enzyme and 
detectable amounts of product under the current growth conditions. Both 
exogenously added substrates were taken up by yeast, although slightly less of 
the longer chain PUFA, dihomo-y-linolenic acid (20:3), was incorporated into 
yeast than linoleic acid (18:2) when either was added in free form to the induced 

1 5 yeast cultures, y-linolenic acid was detected when linoleic acid was present 
during induction and expression of S. cerevisiae 334 (pCGR-5). The presence 
of this PUFA demonstrates A6-desaturase activity from pCGR-5 (MA524). 
Linoleic acid, identified in the extracted lipids from expression of S. cerevisiae 
334 (pCGR-7), classifies the cDNA MA648 from M. alpina as the A12- " 

20 desaturase. 
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Example 7 

Expression of A5 Desaturase in Plants 
Expression in Leaves 

This experiment was designed to determine whether leaves expressing 
5 Ma29 (as determined by Northern) were able to convert exogenously applied 
DGLA (20:3) to ARA (20:4). 

The Ma29 desaturase cDNA was modified by PCR to introduce 
convenient restriction sites for cloning. The desaturase coding region has been 
inserted into a d35 cassette under the control of the double 35S promoter for 
1 0 expression in Brassica leaves (pCGN5525) following standard protocols (see 
USPN 5,424,200 and USPN 5,106,739). Transgenic Brassica plants containing 
pCGN5525 were generated following standard protocols (see USPN 5,188,958 
and USPN 5,463,174). 

In the first experiment, three plants were used: a control, LP004-1, and 
1 5 two transgenics,, 5525-23 and 5525-29. LP004 is a low-linolenic Brassica 
variety. Leaves of each were selected for one of three treatments: water, GLA 
or DGLA. GLA and DGLA were purchased as sodium salts from NuChek Prep 
and dissolved in water at 1 mg/ml. Aliquots were capped under N 2 and stored at 
-70 degrees C. Leaves were treated by applying a 50 fil drop to the upper 
20 surface and gently spreading with a gloved finger to cover the entire surface. 
Applications were made approximately 30 minutes before the end of the light 
cycle to minimize any photo-oxidation of the applied fatty acids. After 6 days 
of treatment one leaf from each treatment was harvested and cut in half through 
the mid rib. One half was washed with water to attempt to remove 
25 unincorporated fatty acid. Leaf samples were lyophilized overnight, and fatty 
acid composition determined by gas chromatography (GC). The results are 
shown in Table 3. 
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20 



Leaves treated with GLA contained from 1 .56 to 2.4 wt% GLA. The fatty acid 
analysis showed that the lipid composition of control and transgenic leaves was 
essentially the same. Leaves of control plants treated with DGLA contained 
1.2-1.9 w% DGLA and background amounts of ARA (.26-.27 wt%). 
Transgenic leaves contained only .2-.7 wt% DGLA, but levels of ARA were 
increased (.74-1.1 wt%) indicating that the DGLA was converted to ARA in 
these leaves. 

Expression in Seed 

The purpose of this experiment was to determine whether a construct 
with the seed specific napin promoter would enable expression in seed. 

The Ma29 cDNA was modified by PCR to introduce Xhol cloning sites 
upstream and downstream of the start and stop codons, respectively, using the 
following primers: 

Madxho-forward: 

S'-CUACUACUACUACTCGAGCAAGATGGGAACGGACCAAGG 
(SEQ ID NO:25) 

Madxho-reverse : 

5 ! -CAUCAUCAUCAUCTCGAGCTACTCTTCCTTGGGACGGAG 
(SEQ ID NO:26). 

The PCR product was subcloned into pAMPl (GIBCOBRL) using the 
CloneAmp system (GIBCOBRL) to create pCGN5522 and the A5 desaturase 
sequence was verified by sequencing of both strands. 

For seed-specific expression, the Ma29 coding region was cut out of 
pCGN5522 as an Xhol fragment and inserted into the Sail site of the napin 
expression cassette, pCGN3223, to create pCGN5528. The HindUl fragment of 
pCGN5528 containing the napin 5* regulatory region, the Ma29 coding region, 
and the napin 3' regulatory region was inserted into the Hindlll site of 
pCGN 1 557 to create pCGN553 1 . Two copies of the napin transcriptional unit 
were inserted in tandem. This tandem construct can permit higher expression of 
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the desaturases per genetic loci. pCGN5531 was introduced into Brassica 
napus cv.LP004 via Agrobacterium mediated transformation. 

The fatty acid composition of twenty-seed pools of mature T2 seeds was 
analyzed by GC. Table 4 shows the results obtained with independent 
5 transformed lines as compared to non-transformed LP004 seed. The transgenic 
seeds containing pCGN5531 contain two fatty acids that are not present in the 
control seeds, tentatively identified as taxoleic acid (5,9-18:2) and pinolenic 
acid (5,9,12-18:3), based on their elution relative to oleic and linoleic acid. 
These would be the expected products of A5 desaturation of oleic and linoleic 
1 0 acids. No other differences in fatty acid composition were observed in the 
transgenic seeds. 
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Northern analysis is performed on plants to identify those expressing 
Ma29. Developing embryos are isolated approximately 25 days post anthesis or 
when the napin promoter is induced, and floated in a solution containing GLA 
or DGLA as described in Example 7. Fatty acid analysis of the embryos is then 
5 performed by GC to determine the amount of conversion of DGLA to ARA, 

following the protocol adapted for leaves in Example 7. The amount of ARA 
incorporated into triglycerides by endogenous Brassica acyltransferases is then 
evaluated by GC analysis as in Example 7. 



Example 8 

10 Expression of M. alpina A6 Desaturase in Brassica napus 

The Ma524 cDNA was modified by PCR to introduce cloning sites 
using the following primers: 



Ma524PCR-l (SEQ ID NO:27) 

15 5-CUACUACUACUATCTAGACTCGAGACCATGGCTGCTGCT 
CCAGTGTG 



Ma524PCR-2 (SEQ ID NO:28) 

5 f -CAUCAUCAUCAUAGGCCTCGAGTTACTGCGCCTTACCCAT 

20 These primers allowed the amplification of the entire coding region and 

added Xbal and Xhol sites to the 5-end and Xhol and Stul sites to the 3' end. 
The PCR product was subcloned into pAMPl (GIBCOBRL) using the 
CloneAmp system (GIBCOBRL) to create pCGN5535 and the A6 desaturase 
sequence was verified by sequencing of both strands. 

25 For seed-specific expression, the Ma524 coding region was cut out of 

pCGN5535 as an Xhol fragment and inserted into the Sail site of the napin 
expression cassette, pCGN3223, to create pCGN5536. The Notl fragment of 
pCGN5536 containing the napin 5' regulatory region, the Ma524 coding region, 
and the napin 3 f regulatory region was inserted into the Notl site of pCGN1557 
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to create pCGN5538. pCGN5538 was introduced into Brassica napus 
cv.LP004 via Agrobacterium mediated transformation. 

Maturing T2 seeds were collected from 6 independent transformation 
events in the greenhouse. The fatty acid composition of single seeds was 
5 analyzed by GC. Table 5 shows the results of control LP004 seeds and six 5538 

lines. All of the 5538 lines except #8 produced seeds containing GLA. 
Presence of GLA segregated in these seeds as is expected for the T2 selfed seed 
population. In addition to GLA, the M alpina A6 desaturase is capable of 
producing 18:4 (stearidonic) and another fatty acid believed to be the 6,9-18:2. 

1 0 The above results show that desaturases with three different substrate 

specificities can be expressed in a heterologous system and used to produce 
poly-unsaturated long chain fatty acids. Exemplified were the production of 
ARA (20:4) from the precursor 20:3 (DGLA), the production of GLA (18:3) 
from 18:2 substrate, and the conversion of 18:1 substrate to 18:2, which is the 

1 5 precursor for GLA. 
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Example 9 

Expression of M. alpina A12 desaturase in Brass ica nanus 
The Ma648 cDNA was modified by PCR to introduce cloning sites 
using the following primers: 

5 Ma648PCR-for (SEQ ID NO:29) 

S'-CUACUACUACUAGGATCCATGGCACCTCCCAACACT 

Ma648PCR-rev (SEQ ID NO:30) 

S'-CAUCAUCAUCAUGGTACCTCGAGTTACTTCTTGAAAAAGAC 

These primers allowed the amplification of the entire coding region and 
10 added a BamHI site to the 5' end and Kpnl and Xhol sites to the 3* end. The 

PCR product was subcloned into pAMPl (GIBCOBRL) using the Clone Amp 
system (GIBCOBRL) to create pCGN5540 and the A12 desaturase sequence 
was verified by sequencing of both strands. 

For seed-specific expression, the Ma648 coding region was cut out of 
1 5 pCGN5540 as a BamHI/XhoI fragment and inserted between the Bglll and 

Xhol sites of the napin expression cassette, pCGN3223, to create pCGN5542. 
The Asp718 fragment of pCGN5541 containing the napin 5 f regulatory region, 
the Ma648 coding region, and the napin 3 f regulatory region was inserted into 
the Asp718 site of pCGN5138 to create pCGN5542. PCGN5542 was 
20 introduced into two varieties of Brassica napus via Agrobacterium mediated 

transformation. The commercial canola variety, SP30021, and a low-linolenic 
line, LP30108 were used. 

Mature selfed T2 seeds were collected from 19 independent LP30108 
transformation events and a non-transformed control grown in the greenhouse. 
25 These seeds are expected to be segregating for the A12 desaturase transgene. 

The fatty acid composition of 20-seed pools was analyzed by GC. The results 
are shown in Table 6. All transformed lines contained increased levels of 18:2, 
the product of the A 12 desaturase. Levels of 1 8:3 were not significantly 
increased in these plants. Events # 1 1 and 16 showed the greatest accumulation 
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of 1 8:2 in the pooled seeds. To investigate the segregation of 1 8:2 levels in the 
T2 seeds and to identify individual plants to be taken on to subsequent 
generations, half-seed analysis was done. Seeds were germinated overnight in 
the dark at 30 degrees on water-soaked filter paper. The outer cotyledon was 
5 excised for GC analysis and the rest of the seedling was planted in soil. Results 

of some of these analyses are shown in Table 7. Individual T2 seeds containing 
the M. alpina A 12 desaturase accumulated up to 60% 1 8:2 in the seeds. Sample 
97xxl 116 #59 is an example of a null segregant. Even in the highest 18:2 
accumulators, levels of 18:3 were increased only slightly. These and other 
10 individually selected T2 plants were grown in the greenhouse and in the field to 

produce T3 seed. 

Mature selfed T2 seeds were collected from 20 independent SP30021 
transformation events and a non-transformed control grown in the greenhouse. 
These seeds are expected to be segregating for the A 12 desaturase transgene. 

15 The fatty acid composition of 20-seed pools was analyzed by GC. The data are 

presented in Table 8. All transformed lines contained increased levels of 1 8:2, 
the product of the A12 desaturase. As in the low-linolenic LP30108 line, levels 
of 18:3 were not significantly increased. Events # 4 and 12 showed the greatest 
accumulation of 1 8:2 in the pooled seeds. To investigate the segregation of 

20 18:2 levels in the T2 seeds and to identify individual plants to be taken on to 

subsequent generations, alf-seed analysis was done. Seeds were germinated 
overnight in the dark at 30 degrees on water-soaked filter paper. The outer 
cotyledon was excised for GC analysis and the rest of the seedling was planted 
in soil. Results of some of these analyses are shown in Table 9. Samples 

25 97xxl 157 #88 and #18 are examples of null segregants for 5542-SP30021-4 and 

5542-SP30021-12 respectively. These and other individually selected T2 plants 
were grown in the greenhouse and in the field to produce T3 seed 
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Example 10 

Simultaneous expression of M alpina A6 and A 12 
desaturases in Brassica napus 

In order to express the M. alpina A6 and A 12 desaturases from the same 
T-DNA, the following construct for seed-specific expression was made. 

The NotI fragment of pCGN5536 containing the containing the napin 5' 
regulatory region, the Ma524 coding region, and the napin 3' regulatory region 
was inserted into the NotI site of pCGN5542 to create pCGN5544. The 
expression modules were oriented in such a way that the direction of 
transcription from Ma524 and Ma648 and the nptll marker is the same. 

PCGN5544 was introduced into Brassica napus cv.LP30108 via 
Agrobacterium mediated transformation. Mature selfed T2 seeds were collected 
from 16 independent LP30108 transformation events and a non-transformed 
control that were grown in the greenhouse. These seeds are expected to be 
segregating for the A6+ A12 desaturase transgene. The fatty acid composition 
of 20-seed pools was analyzed by GC. The results are presented in Table 10. 
All but one of the lines (5544-LP30 108-3) shows an altered oil composition as 
compared to the controls. GLA was produced in all but three of the lines (-3, -4, 
-1 1); two of the three without GLA ( -4, -11) showed increased 18:2 indicative 
of expression of the A12 desaturase. As a group, the levels of GLA observed in 
plants containing the double A6 + A 12 construct (pCGN5544) were higher than 
those of plants containing pCGN5538 (A6 alone). In addition, levels of the A 6,9 
18:2 are much reduced in the plants containing the A12 + A6 as compared to A6 
alone. Thus, the combination of A6 and A 12 desaturases on one T-DNA leads 
to the accumulation of more GLA and fewer side products than expression of 
A6 desaturase alone. To investigate the segregation of GLA levels in the T2 
seeds and to identify individual plants to be taken on to subsequent generations, 
half-seed analysis was done. Seeds were germinated overnight in the dark at 30 
degrees on water-soaked filter paper. The outer cotyledon was excised for GC 
analysis and the rest of the seedling was planted in soil. Results of some of 
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these analyses are shown in Table 1 1. As expected for the T2 population, levels 
of GLA and 18:2 are segregating in the individual seeds. GLA content of up to 
60% of total fatty acids was observed in individual seeds. Individual events 
were selected to be grown in the greenhouse and field for production of T3 
5 seed. 

Transgenic plants including Brassica, soybean, safflower, corn flax and 
sunflower expressing the constructs of this invention can be a good source of 
GLA. 

Typical sources of GLA such as borage produce at most 25% GLA. In 
10 contrast the plants in Table 10 contain up to 30% GLA. Furthermore, the 

individual seeds shown in Table 1 1 contain up to 60% GLA. 
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Example 11 

Simultaneous expression of ML alpina AS and A6 
desaturases in Brass ica napus 

5 In order to produce arachadonic acid (ARA) in transgenic canola oil 

both A5 and A6 desaturase activities need to be introduced. In order to facilitate 

downstream characterization and breeding, it may be advantageous to have both 

activities encoded by a single T-DNA. The following example illustrates the 

simultaneous expression of A5 and A6 desaturases. 

1 0 The Asp7 1 8 fragment of pCGN5528 containing the napin 5' regulatory 

region, the Ma29 coding region, and the napin 3' regulatory region was inserted 
into the Asp718 site of pCGN5138 to create pCGN5545. The NotI fragment of 
pCGN5536 containing the napin 5' regulatory region, the Ma524 coding region, 
and the napin 3' regulatory region was inserted into the NotI site of pCGN5545 

15 to create pCGN5546. The expression modules were oriented in such a way that 

the direction of transcription from Ma524 and Ma29 and the nptll marker is the 
same. 

PCGN5546 was introduced into Brassica napus cv.LP30108 via 
Agrobacterium mediated transformation. Mature selfed T2 seeds were collected 
20 from 30 independent LP301 08 transformation events that were grown in the 

greenhouse. The fatty acid composition of 20-seed pools was analyzed by GC. 
The results are shown in Table 12. All the lines show expression of both 
desaturases as evidenced by the presence of A 5,9 18:2 (as seen in pCGN553 1 
plants) and A 6 * 9 18:2 and GLA (as seen in pCGN5538 plants) 
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Example 12 

Simultaneous expression of M alpina A5, A6 and A12 
desaturases in Brassica nanus 

5 In order to achieve optimal production of ARA in transgenic canola oil 

both the A6 and A12 desaturase activities may need to be present in addition to 

the A5 activity. In order to facilitate downstream characterization and breeding, 

it may be advantageous to have all of these activities encoded by a single T- 

DNA. The following example illustrates the simultaneous expression of A5, A6 

1 0 and A 1 2 desaturases. 

The Hindlll fragment of pCGN5528 containing the napin 5' regulatory 
region, the Ma29 coding region, and the napin 3 f regulatory region was inserted 
into the Hindlll site of pCGN5544 to create pCGN5547. The expression 
modules were oriented in such a way that the direction of transcription from 
1 5 Ma29, Ma524, Ma648 and the nptll marker is the same. 

PCGN5547 was introduced into Brassica napus cv.LP30108 via 
Agrobacterium mediated transformation. Mature selfed T2 seeds were collected 
from 30 independent LP301 08 transformation events that were grown in the 
greenhouse. The fatty acid composition of 20-seed pools was analyzed by GC. 

20 The results are shown in Table 13. Twenty-seven of the lines show significant 

accumulation of GLA and in general the levels of GLA observed are higher 
than those seen in the 5546 plants that did not contain the A12 desaturase. The 
A 12 desaturase appears to be active in most lines as evidenced by the lack of 
detectable A6,9 18:2 and elevated 18:2 levels in most plants. Small amounts of 

25 A5,9 18:2 are seen in the 5547 plants, although the levels are generally less than 

those observed in the 5546 plants. This may be due to the presence of the A 12 
desaturase which efficiently converts the 18:1 to 18:2 before it can be 
desaturated at the A5 position. 
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Example 13 

Stereospecific Distribution of A6-Desaturated Oils 

This experiment was designed to investigate the stereospecific 
distribution of the A6-desaturated oils in seeds expressing pCGN5538 (Ma 524 
5 cDNA). Three seed samples were used: 

1) Non-transformed B. napus cv. LP004 seeds (control) 

2) Segregating T2 seeds of pCGN553 8-LP004- 1 9 

3) Segregating T2 seeds of pCGN5538 LP004-29 
The following protocol was used for the analysis: 

10 1 . Seed Oil Extraction 

Fifty seeds were placed in a 12 x 32 mm vial and crushed with a glass 
rod. 1.25 mL hexane was added and the mixture was vortexed. The seeds were 
extracted overnight on a shaker. The extract was then filtered through a 0.2 
micron filter attached to a 1 cc syringe. The extract was then dried down under 
1 5 nitrogen. The resulting oil was used for digestion and derivatization of the 

whole oil sample. 

2. Digestion 

A. Liquid Oil Digestion 

The stock lipase (from Rhizopus arrhizus, Sigma, L4384) was diluted to 
20 approximately 600,000 units/mL with a goal of obtaining 50% digestion of the 

TAG. The stock lipase is maintained at 4 degrees C and placed on ice. The 
amount of reagents may be adjusted according to the amount of oil to be 
digested. 

The following amounts are based on a 2.0 mg extracted oil sample. In a 
25 12 x 32 mm screw cap vial the following were added: 2.0 mg oil, 200 |iL 0.1 M 

tris HC1 pH 7, 40 \iL 2.2 w/v% CaCl 2 2H 2 0, and 100 *iL 0.05 w/v % bile salts. 
The material was vortexed and sonicated to disperse the oil. Twenty |iL of 
diluted lipase was added and the mixture was vortexed continuously for 1.0 
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minute at room temperature. A white precipitate formed. The reaction was 
stopped with 100 uL 6M HC1 and vortexing. Five hundred uL CHC1 3 :CH 3 0H 
(2:1) was added and the mixture was vortexed and held on ice while reaining 
digestions were carried out. Samples were vortexed again and centrifuged 
5 briefly to sharpen layers. The lower layer containing digest products was 

removed with a pasteur pipette and placed in a 12 x 32 mm crimp cap vial. The 
material was then re-extracted with 300 \xL CHC1 3 , vortexed, centrifuged, and 
combined with the lower layers. The digest products were kept on ice as much 
as possible. HPLC separation is performed as soon as possible after digestion to 
1 0 minimize acyl migration. 

B. Solid Fat Digestion 

The procedure for liquid oil digestion described above was followed 
except that 20 \xl 1 1 :0 methyl ester is added to 2.0 mg solid fat. 

3. HPLC Separation 

1 5 The digestion products were dried down in chloroform to approximately 

200 \xL. Each sample was then transferred into an insert in an 8 x 40 mm shell 
vial and 30 |iL was injected for HPLC analysis. 

The high performance liquid chromatographic system was equipped 
with a Varex ELSD IIA evaporative light scattering detector with tube 
20 temperature at 105°C and nitrogen gas flow at 40 mL/min; a Waters 712 Wisp 

autosampler, three Beckman 1 14M Solvent Delivery Modules; a Beckman 
421 A controller, a Rheodyne pneumatically actuated stream splitter; and a 
Gilson micro fractionator. The chromatography column is a 220 x 4.6 mm, 5 
micron normal phase silica cartridge by Brownlee. 

25 The three solvents used were: 

A= hexanertoluene 1 : 1 

B= toluene: ethyl acetate 3 : 1 

C= 5% formic acid in ethyl acetate 
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The gradient profile was as follows: 



Time (min) 


Function 


Value 


Duration 


Oflow 


2.0 mL/min 






0%B 


10 






0%C 


2 






2%C 


25 




6 min 


14.0 % C 


2 




1 min 


15.0 


End program 







A chromatographic standard mixture is prepared in hexanertoluene 1:1 
containing the following: 

0.2 mg/mL triglyceride 16:0 



5 2.0 mg/mL 16:0 Free Fatty Acid 

0.2 mg/mL dil6:0 mixed isomers (1,2-diacylglycerol and 1,3-diacyl glycerol) 
0.2 mg/mL 3-mono acylglycerol 16:0 
0.2 mg/mL 2-mono acylglycerol 16:0 

For each sample, the fraction containing the 2-mag peak is collected 
10 automatically by method controlled timed events relays. A time delay is used to 

synchronize the detector with the collector's emitter. The 2-mag peaks are 
collected and the fractions are evaporated at room temperature overnight. 

The sn-2 composition results rely on minimization of acyl migration. 
Appearance of 1 -monoacylglycerol and/or 3-monoacylglycerol peaks in the 
15 chromatograph means that acyl migration has occurred. 

4. Derivatization 

To derivatize the whole oil, 1.0 mg of the extracted whole oil was 
weighed into a 12 x 32 mm crimp cap vial. One mL toluene was then added. 
The sample is then vortexed and a 50 |iL aliquot was removed for 
20 derivatization. To the dried down 2-mag samples, 50 \iL toluene was added. To 

both the whole oil and 2-mag fractions 105 uL H2SO4/CH3OH @ 8.76 wt% is 
added. The cap was tightly capped and the sample is refluxed for 1 hour at 95 
degrees C. The sample was allowed to cool and 500 uL 10 w/v % NaCl in 
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water and 60 uL heptane was added. The organic layer was removed and 
inserted in a 12 x 32 mm crimp cap vial. 

5. GLC Analysis 

A Hewlett Packard model 6890 GC equipped with a split/splitless 
5 capillary inlet, FID detector, 6890 series autosampler and 3392A Alpha Omega 

integrator is set up for the capillary column as follows: 



A. Supelco Omegawax 250, 30 m length, 0.25 mm id, 0.25 urn film 
thickness 



injection port: 


260 C 


detector: 


270 C 


initial temp: 


170 C 


initial time: 


1.5 min 


rate: 


30 deg/min 


final temp: 


245 C 


final time: 


6.5 min 


injection vol: 


1.5 uL 


head pressure: 


25 psi 


split ratio: 


30 


carrier gas: 


He 


make-up gas: 


N 2 


FID gas: 


H + air 



Percent compositions of fatty acid methyl esters are calculated as mole 
percents. For carbon chain lengths less than 12, the use of theoretical or 
25 empirical response factors in the area percent calculation is desirable. 
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6. Calculations 

The mean distribution of each acyl group at each sn-l and sn-3 position 
was calculated. 

mean sn-l and sn-3 composition = (3 WO comp - MAG comp) / 2 
5 WO = whole oil 

MAG= monoacylglycerol 

The results of this analysis are presented in Table 14. The GLA and A 6,9 
18:2 are evenly distributed between the sn-2 and sn-l, 3 positions. This 
analysis can not discriminate between fatty acids in the sn-l vs. sn-3 positions. 
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Example 14 

Fatty Acid Compositions of Transgenic Plants 

A5 and A6 transgenic plants were analyzed for their fatty acid content. 

The following protocol was used for oil extraction: 

5 1 . About 400 mg of seed were weighed out in duplicate for each 

sample. 

2. The seeds were crushed in a motar and pestle. The mortar and 
pestle was rinsed twice with 3ml (2:1) (v:v) 
CHCl 3 :CH 3 OH/MeOH. An additional 6 ml (2: 1) was added to 

10 the 20ml glass vial (oil extracted in 12ml total 2:1). 

3. Samples were vortexed and placed on an orbital shaker for 2 
hours with occasional vortexing. 

4. SmloflMNaCl was added to each sample. Sample was 
vortexed then spun in centrifuge at 2000rpm for 5 minutes. 

1 5 Lower phase was drawn off using a pasteur pipette. 

5. Upper phase was re-extracted with an additional 5ml. Sample 
was vortexed then spun in centrifuge at 2000 rpm for 5 minutes. 
The lower phase was drawn off using a pasteur pipette and added 
to previous lower phase. 

20 6 - CHCl3:CH 3 OH /MeOH was evaporated under nitrogen using 

evaporative cooling. Vial containing extracted oil was sealed 
under nitrogen. Between 120mg- 160mg oil was extracted for 
each sample. 

For GC-MS analysis, fatty acid methyl esters were dissolved in an 
25 appropriate volume of hexane and analyzed using a Hewlett-Packard 5890 

Series II Plus gas chromatograph (Hewlett Packard, Palo Alto, C A) equipped 
with a 30 m x 0.32 mm i.d. Omegawax 320 fused sillica capillary column 
(Supelco, Bellefonte, PA) and a Hewlett-Packard 5972 Series mass selective 
detector. Mass spectra were intrepreted by comparison to the mass spectra in 



-91- 



WO 98/46764 




PCT/US98/07421 



NIST/EPA/NIH Chemical Structure Database using a MS Chem Station 
(#G1036A) (Hewlett Packard). 

Transgenic line 5531-6 was analyzed in duplicate (A, B) and compared 
to control line LP004-6. The fatty acid profile results are shown in Table 15. 

5 Transgenic line 5538-19 was analyzed in duplicate (A, B) and compared 

to control line LP004-6. The fatty acid profile results are shown in Table 16. 
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Table 15 
Fatty Acid Profile 
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TRANSGENIC 
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not mm? h 


AA] mini A 


OOlfUlOl.d 


O01f0104.d 


C12:0 










C13:0 










C14:0 




A A<*2 




0.061 


C14:l 










C15*0 komcr 










C15:0 










C16:0 


4 107 


' /I A1/1 

4.UJ4 


4.257 


4.224 


C16:l 


\J. 1 O 1 


a i tft 

U. 1 /J 


A *>AA 

0.200 


0.199 


C16:2 


0 061 


A A£^ 


A AO 1 

0.081 


0.060 


C17:0 










C16:3 


0 244 




A 1 CC 

0.155 


0.151 


C16:4 










C18:0 


2 608 


7 714 
JL. 1 14 


J.Joo 


3.417 


C18:lw9 


65 489 


r»A 4^4 

00.434 


59.529 


59.073 


C18:lw7 


2 297 


9 18^ 


2.356 


2.393 


C18:2 5,9 






0.144 


6.269 


C18:2w6 


19 898 
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19.059 
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0.496 


C18:3w6 
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1 418 
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C18:4w3 










C20:0 


0.962 


0.998 


1.009 


1.022 


C20:lwll 


1.336 


1.335 


1.058 


1.065 


C20:lw9 










C20:lw7 






0.076 


0.080 


C20:2w6 


0.073 


0.073 




0.052 


C20:3w6 
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Table 15 
Fattv Acid Profile 





CONTROL 


CONTROL 


TRANSGENIC 


TRANSGENIC 














LP004-6A 


LP004-6B 


5531-6A 


5531-6B 














LRL-2043 


LRL-2044 


LRL-2042 


LRL-2045 




001f0102.d 


001f0103.d 


OOlfOlOl.d 


001f0104.d 


C20:4w6 










C20:3w3 










C20:4w3 










C20:5w3 










C22:0(L000) 


0.542 


0.558 


0.463 


0.467 


C22:lwll 




0.038 






C22:lw9 










C22:lw7 




0.034 






C21:5 










C23:0 




0.029 
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C22:5w3 










C24:0 


0.373 


0.391 


0.280 


0.283 | 


C22:6w3 


0.314 


0.317 


0.223 


0.212 


C24:lw9 




















TOTAL 


100.00 


100.00 


100.00 


I 100.00 
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Table 16 
Fatty Acid Profile 





5538-19A 


5538-19B 


LP004-6A 


LP004-6B 
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TRANSGENIC 


CONTROL 
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C18:0 


3.815 
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C18:l 
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68.506 


68.352 


CI 8:2 (6,9) 
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Table 16 
Fattv Acid Profile 





5538-19A 


5538-1 9B 


LP004-6A 


LP004-6B 




TRANSGENIC 


TRANSGENIC 


CONTROL 


CONTROL 














LRL-2166 


LRL-2167 


LRL-2168 


LRL-2169 












C20:4w3 










C20:5w3 










C22:0 


0.506 


0.484 


0.535 


0.539 


C22:l 


0.017 


0.020 


0.032 


0.032 


C21:5 




0.040 


0.030 


0.031 


C22:4w6 


0.038 


0.064 


0.015 


0.014 


C22:5w6 










C22:5w3 


0.023 


0.018 


0.021 


0.017 


C24:0 


0.352 


0.321 


0.353 


0.362 


C22:6w3 


0.009 








C24:lw9 


0.129 


0.121 


0.260 


0.255 












TOTAL 


100.00 


100.00 


100.00 


100.00 
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Example 15 

Combined Expression of A6 and A12 
Desaturases in B. napus Achieved by Crossing 

Plants containing either the A6 or the A 12 desaturase were crossed and 

5 individual Fl half-seeds were analyzed for fatty acid composition by GC. Data 

from one such cross are given in Table 17. The parents for the cross were 

5538-LP004-25-2-25 (A6 expressor) and 5542-SP30021-10-16 (A12 expressor). 

Reciprocal crosses were made and the results of 25 individual Fl seeds of each 

are shown in the table. Crosses are described such that the first parent indicated 

10 is the female. Both sets of crosses gave approximately the same results. 

Compared to the parents, the A 6 * 9 18:2 decreased, and the GLA increased. A 9,12 
18:2 levels are increased in most of the FTs as well. Note that these are Fl 
seeds and only contain one set of each desaturase. In future generations and 
selection of events homozygous for each desaturase, the F2 GLA levels 

1 5 obtained may be even higher. 

Combining traits by crossing may be preferable to combining traits on 
. one T-DNA in some situations. Particularly if both genes are driven off of the 
same promoter (in this case napin), issues of promoter silencing may favor this 
approach over putting nultiple cDNAs on one construct. 

20 Alternatively, in some cases, combining multiple cDNAs on one T-DNA 

may be the method of choice. The results are shown in Table 1 7. 
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Example 16 

Expression of M. alnina desatu rases in soybean 
The M. alpina desaturases can be used to drive production of GLA and 
other PUFAs in soybean by use of the following expression constructs. Two 
means by which exogenous DNA can be inserted into the soybean genome are 
Agrobacterium infection or particle gun. Particle gun transformation is 
disclosed in U.S. patent 5,503,998. Plants can be selected using a glyphosate 
resistance marker (4, 971, 908). Agrobacterium transformation of soybean is 
well established to one of ordinary skill in the art. 

For seed specific expression, the coding regions of the desaturase 
cDNAs are placed under control of the 5* regulatory region of Glycine max 
alpha-type beta conglycinin storage protein gene. The specific region that can 
be used is nucleotides 78-921 of gi 169928 (Doyle, J.J., Schuler, MA., ' 
Godette, W.D., Zenger, V., Beachy, R.N., and Slightom. J.L., 1986 J. Biol. 
Chem. 261 (20), 9228-9238). The 3* regulatory region that can be used is from 
the pea ribulose 1,5 bisphosphate carboxylase/oxygenase small subunit (rbcS) 
gene. The specific sequences to be used are nucleotides 1-645 of gi 169145 
(Hunt, A.G. 1988 DNA 7: 329-336). 

Since soybean seeds contain more 18:2, and perhaps more endogenous 
A 12 desaturase activity than Brassica, the effect of the Mortierella A 12 
desaturase on achieving optimal GLA levels can be tested as follows. A 
construct containing the A6 cDNA can be used to see if A 6 ' 9 18:2 is produced 
along with GLA. A construct containing the A 12 desaturase can be used to see 
if the amount of 1 8:2 can be increased in soybean. A construct containing both 
the A6 and A 12 desaturases can be used to produce optimal levels of GLA. 
Alternatively, plants containing each of the single desaturases may be crossed if 
necessary to combine the genes. 

Similar constructs may be made to express the A5 desaturase alone, or in 
combination with A 12 and/or A6 desaturases. 
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Example 17 

Human Desaturase Gene Sequences 
Human desaturase gene sequences potentially involved in long chain 
polyunsaturated fatty acid biosynthesis were isolated based on homology 
5 between the human cDNA sequences and Mortierella alpina desaturase gene 

sequences. The three conserved "histidine boxes" known to be conserved 
among membrane-bound desaturases were found. As with some other 
membrane-bound desaturases the final HXXHH histidine box motif was found 
to be QXXHH. The amino acid sequence of the putative human desaturases 
10 exhibited homology to M alpina A5, A6, A9, and A12 desaturases. 

The M. alpina A5 desaturase and A6 desaturase cDNA sequences were 
used to search the LifeSeq database of Incyte Pharmaceuticals, Inc., Palo Alto, 
California 94304. The A5 desaturase sequence was divided into fragments; 1) 
amino acid no. 1-150, 2) amino acid no. 151-300, and 3) amino acid no. 301- 

15 446. The A6 desaturase sequence was divided into three fragments; 1) amino 

acid no. 1-150, 2) amino acid no. 151-300, and 3) amino acid no. 301-457. 
These polypeptide fragments were searched against the database using the 
"tblastn" algorithm. This alogarithm compares a protein query sequence against 
a nucleotide sequence database dynamically translated in all six reading frames 

20 (both strands). 

The polypeptide fragments 2 and 3 of M. alpina A5 and A6 have 
homologies with the ClonelD sequences as outlined in Table 18. The ClonelD 
represents an individual sequence from the Incyte LifeSeq database. After the 
"tblastn" results have been reviewed, Clone Information was searched with the 

25 default settings of Stringency of >=50, and Productscore <=1 00 for different 

ClonelD numbers. The Clone Information Results displayed the information 
including the ClusterlD, ClonelD, Library, HitID, Hit Description. When 
selected, the ClusterlD number displayed the clone information of all the clones 
that belong in that ClusterlD. The Assemble command assembles all of the 

30 ClonelD which comprise the ClusterlD. The following default settings were 
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used for GCG (Genetics Computer Group, University of Wisconsin 
Biotechnology Center, Madison, Wisconsin 53705) Assembly: 

Word Size: 7 

5 Minimum Overlap: 14 

Stringency: 0.8 

Minimum Identity: 14 

Maximum Gap: 10 

Gap Weight: 8 

10 Length Weight: 2 

GCG Assembly Results displayed the contigs generated on the basis of 
sequence information within the ClonelD. A contig is an alignment of DNA 
sequences based on areas of homology among these sequences. A new 

1 5 sequence (consensus sequence) was generated based on the aligned DNA 

sequences within a contig. The contig containing the ClonelD was identified, 
and the ambiguous sites of the consensus sequence was edited based on the 
alignment of the ClonelDs (see SEQ ID NO:31 - SEQ ID NO:35) to generate 
the best possible sequence. The procedure was repeated for all six ClonelD 

20 listed in Table 18. This produced five unique contigs. The edited consensus 

sequences of the 5 contigs were imported into the Sequencher software program 
(Gene Codes Corporation, Ann Arbor, Michigan 48 105). These consensus 
sequences were assembled. The contig 251 1785 overlaps with contig 3506132, 
and this new contig was called 2535 (SEQ ID NO:37). The contigs from the 

25 Sequencher program were copied into the Sequence Analysis software package 

of GCG. 

Each contig was translated in all six reading frames into protein 
sequences. The M. alpina A5 (MA29) and A6 (MA524) sequences were 
compared with each of the translated contigs using the FastA search (a Pearson 
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and Lipman search for similarity between a query sequence and a group of 
sequences of the same type (nucleic acid or protein)). Homology among these 
sequences suggest the open reading frames of each contig. The homology 
among the M alpina A5 and A6 to contigs 2535 and 3854933 were utilized to 
5 create the final contig called 253538a. Figure 9 is the FastA match of the final 

contig 253538a and MA29, and Figure 10 is the FastA match of the final contig 
253538a and MA524. The DNA sequences for the various contigs are 
presented in SEQ ID NO:31 -SEQ ID NO:37 The various peptide sequences 
are shown in SEQ ID NO:38 - SEQ ID NO: 44. 

1 0 Although the open reading frame was generated by merging the two 

contigs, the contig 2535 shows that there is a unique sequence in the beginning 
of this contig which does not match with the contig 3854933. Therefore, it is 
possible that these contigs were generated from independent desaturase like 
human genes. 

1 5 The contig 253538a contains an open reading frame encoding 432 

amino acids. It starts with Gin (CAG) and ends with the stop codon (TGA). 
The contig 253538a aligns with both Af. alpina A5 and A6 sequences, 
suggesting that it could be either of the desaturases, as well as other known 
- desaturases which share homology with each other. The individual contigs 

20 listed in Table 18, as well as the intermediate contig 2535 and the final contig 

253538a can be utilized to isolate the complete genes for human desaturases. 

Uses of the Human Desaturases 

These human sequences can be expressed in yeast and plants utilizing 
the procedures described in the preceding examples. For expression in 
25 mammalian cells and transgenic animals, these genes may provide superior 

codon bias. In addition, these sequences can be used to isolate related 
desaturase genes from other organisms. 
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Table 18 



Sections of the 


Clone ID from LifeSeq Database 


Keyword i 


Desatu rases 


151-300 A5 


3808675 


fatty acid desaturase 


301-446 A5 


354535 


A6 


151-300 A6 


3448789 


A6 


151-300 A6 


1362863 


A6 


151-300 A6 


2394760 


A6 


301-457 A6 


3350263 


A6 



Example 18 

Identification of Homojogues to M. alpina AS and A6 desaturases 

A nucleic acid sequence that encodes a putative A5 desaturase was 
identified through a TBLASTN search of the expressed sequence tag databases 
through NCBI using amino acids 100-446 of Ma29 as a query. The truncated 
portion of the Ma29 sequence was used to avoid picking up homologies based 
on the cytochrome b5 portion at the N-terminus of the desaturase. The deduced 
amino acid sequence of an est from Dictyostelium discoideum (accession # 
C25549) shows very significant homology to Ma29 and lesser, but still 
significant homology to Ma524. The DNA sequence is presented as SEQ ID 
NO:45. The amino acid sequence is presented as SEQ ID NO:46. 

Example 19 

Identifica tion of M. alpina AS and A6 homologues in other 
PUFA-producing organisms 

To look for desaturases involved in PUFA production, a cDNA library 

was constructed from total RNA isolated from Phaeodactylum thcornutum. A 

plasmid-based cDNA library was constructed in pSPORTl (GIBCO-BRL) 

following manufacturer's instructions using a commercially available kit 

(GIBCO-BRL). Random cDNA clones were sequenced and nucleic acid 

sequences that encode putative A5 or A6 desaturases were identified through 

BLAST search of the databases and comparison to Ma29 and Ma524 sequences. 



-105- 



WO 98/46764 




PCT/US98/07421 



One clone was identified from the Phaeodactylum library with 
homology to Ma29 and Ma524; it is called 144-01 1-B12. The DNA sequence is 
presented as SEQ ID NO:47. The amino acid sequence is presented as SEQ ID 
NO:48. 

5 Example 20 

Identification of M. alvina AS and A6 homologues in other 
PUFA-producing organisms 

To look for desaturases involved in PUFA production, a cDNA library 

was constructed from total RNA isolated from Schizochytrium species. A 

10 plasmid-based cDNA library was constructed in pSPORTl (GE3CO-BRL) 

following manufacturer's instructions using a commercially available kit 

(GIBCO-BRL). Random cDNA clones were sequenced and nucleic acid 

sequences that encode putative A5 or A6 desaturases were identified through 

BLAST search of the databases and comparison to Ma29 and Ma524 sequences. 

15 One clone was identified from the Schizochytrium library with 

homology to Ma29 and Ma524; it is called 8 1-23-C7. This clone contains a -1 
kb insert. Partial sequence was obtained from each end of the clone using the 
universal forward and reverse sequencing primers. The DNA sequence from 
the forward primer is presented as SEQ ID NO:49. The peptide sequence is 

20 presented as SEQ ID NO:50. The DNA sequence from the reverse primer is 

presented as SEQ ID NO:51. The amino acid sequence from the reverse primer 
is presented as SEQ ID NO:52. 

Example 21 

Nutritional Compositions 

25 The PUFAs of the previous examples can be utilized in various 

nutritional supplements, infant formulations, nutritional substitutes and other 
nutrition solutions. 

I. INFANT FORMULATIONS 

A. Isomil® Soy Formula with Iron. 
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Usage: As a beverage for infants, children and adults with an allergy or 
sensitivity to cow's milk. A feeding for patients with disorders for which 
lactose should be avoided: lactase deficiency, lactose intolerance and 
galactosemia. 

5 Features: 

• Soy protein isolate to avoid symptoms of cow's-milk-protein 
allergy or sensitivity 

• Lactose-free formulation to avoid lactose-associated diarrhea 

• Low osmolaity (240 mOsm/kg water) to reduce risk of osmotic 
10 diarrhea. 

• Dual carbohydrates (corn syrup and sucrose) designed to 
enhance carbohydrate absorption and reduce the risk of exceeding the 
absorptive capacity of the damaged gut. 

• 1.8 mg of Iron (as ferrous sulfate) per 100 Calories to help 
1 5 prevent iron deficiency. 

• Recommended levels of vitamins and minerals. 

• Vegetable oils to provide recommended levels of essential fatty 
acids. 

• Milk-white color, milk-like consistency and pleasant aroma. 

20 Ingredients: (Pareve, ®) 85% water, 4.9% corn syrup, 2.6% sugar 

(sucrose), 2.1% soy oil, 1.9% soy protein isolate, 1.4% coconut oil, 0.15% 
calcium citrate, 0.1 1 % calcium phosphate tribasic, potassium citrate, potassium 
phosphate monobasic, potassium chloride, mono- and disglycerides, soy 
lecithin, carrageenan, ascorbic acid, L-methionine, magnesium chloride, 

25 potassium phosphate dibasic, sodium chloride, choline chloride, taurine, ferrous 

sulfate, m-inositol, alpha-tocopheryl acetate, zinc sulfate, L-carnitine, 
niacinamide, calcium pantothenate, cupric sulfate, vitamin A palmitate, 
thiamine chloride hydrochloride, riboflavin, pyridoxine hydrochloride, folic 
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acid, manganese sulfate, potassium iodide, phylloquinone, biotin, sodium 
selenite, vitamin D3 and cyanocobalamin 

B. Isomil® DF Soy Formula For Diarrhea. 

Usage: As a short-term feeding for the dietary management of diarrhea 
5 in infants and toddlers. 

Features: 

• First infant formula to contain added dietary fiber from soy fiber 
specifically for diarrhea management. 

• Clinically shown to reduce the duration of loose, watery stools 
1 0 during mild to severe diarrhea in infants. 

• Nutritionally complete to meet the nutritional needs of the infant. 

• Soy protein isolate with added L-methionine meets or exceeds an 
infant's requirement for all essential amino acids. 

• Lactose-free formulation to avoid lactose-associated diarrhea. 

1 5 • Low osmolality (240 mOsm/kg water) to reduce the risk of 

osmotic diarrhea. 

• Dual carbohydrates (corn syrup and sucrose) designed to 
enhance carbohydrate absorption and reduce the risk of exceeding the 
absorptive capacity of the damaged gut. 

20 • Meets or exceeds the vitamin and mineral levels recommended 

by the Committee on Nutrition of the American Academy of Pediatrics 
and required by the Infant Formula Act. 

• 1 .8 mg of iron (as ferrous sulfate) per 1 00 Calories to help 
prevent iron deficiency. 

25 • Vegetable oils to provide recommended levels of essential fatty 

acids. 

Ingredients: (Pareve, ©) 86% water, 4.8% corn syrup, 2.5% sugar 
(sucrose), 2.1% soy oil, 2.0% soy protein isolate, 1.4% coconut oil, 0.77% soy 
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fiber, 0.12% calcium citrate, 0.1 1 % calcium phosphate tribasic, 0.10% 
potassium citrate, potassium chloride, potassium phosphate monobasic, mono- 
and disglycerides, soy lecithin, carrageenan, magnesium chloride, ascorbic acid, 
L-methionine, potassium phosphate dibasic, sodium chloride, choline chloride, 
5 taurine, ferrous sulfate, m-inositol, alpha-tocopheryl acetate, zinc sulfate, L- 
carnitine, niacinamide, calcium pantothenate, cupric sulfate, vitamin A 
palmitate, thiamine chloride hydrochloride, riboflavin, pyridoxine 
hydrochloride, folic acid, manganese sulfate, potassium iodide, phylloquinone, 
biotin, sodium selenite, vitamin D3 and cyanocobalamin 

1 0 C. Isomil® SF Sucrose-Free Soy Formula With Iron, 

Usage: As a beverage for infants, children and adults with an allergy or 
sensitivity to cow's-milk protein or an intolerance to sucrose. A feeding for 
patients with disorders for which lactose and sucrose should be avoided. 

Features: 

1 5 • Soy protein isolate to avoid symptoms of cow's-milk-protein 

allergy or sensitivity. 

• Lactose-free formulation to avoid lactose-associated diarrhea 
(carbohydrate source is Polycose® Glucose Polymers). 

• Sucrose free for the patient who cannot tolerate sucrose. 

20 • Low osmolality ( 1 80 mOsm/kg water) to reduce risk of osmotic 

diarrhea. 

• 1 .8 mg of iron (as ferrous sulfate) per 1 00 Calories to help 
prevent iron deficiency. 

• Recommended levels of vitamins and minerals. 

25 # Vegetable oils to provide recommended levels of essential fatty 

acids. 

• Milk-white color, milk-like consistency and pleasant aroma. 

Ingredients: (Pareve, ©) 75% water, 1 1.8% hydrolized cornstarch, 4.1% 
soy oil, 4.1% soy protein isolate, 2.8% coconut oil, 1.0% modified cornstarch, 
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0.38% calcium phosphate tribasic, 0.17% potassium citrate, 0.13% potassium 
chloride, mono- and disglycerides, soy lecithin, magnesium chloride, abscorbic 
acid, L-methionine, calcium carbonate, sodium chloride, choline chloride, 
carrageenan, taurine, ferrous sulfate, m-inositol, alpha-tocopheryl acetate, zinc 
5 sulfate, L-carnitine, niacinamide, calcium pantothenate, cupric sulfate, vitamin 

A palmitate, thiamine chloride hydrochloride, riboflavin, pyridoxine 
hydrochloride, folic acid, manganese sulfate, potassium iodide, phylloquinone, 
biotin, sodium selenite, vitamin D3 and cyanocobalamin 

D. Isomil® 20 Soy Formula With Iron Ready To Feed, 
10 20Cal/floz. 

Usage: When a soy feeding is desired. 

Ingredients: (Pareve, ®) 85% water, 4.9% corn syrup, 2.6% sugar 
(sucrose), 2.1% soy oil, 1.9% soy protein isolate, 1.4% coconut oil, 0.15% 
calcium citrate, 0.1 1% calcium phosphate tribasic, potassium citrate, potassium 

1 5 phosphate monobasic, potassium chloride, mono- and disglycerides, soy 

lecithin, carrageenan, abscorbic acid, L-methionine, magnesium chloride, 
potassium phosphate dibasic, sodium chloride, choline chloride, taurine, ferrous 
sulfate, m-inositol, alpha-tocopheryl acetate, zinc sulfate, L-carnitine, 
niacinamide, calcium pantothenate, cupric sulfate, vitamin A palmitate, 

20 thiamine chloride hydrochloride, riboflavin, pyridoxine hydrochloride, folic 

acid, manganese sulfate, potassium iodide, phylloquinone, biotin, sodium 
selenite, vitamin D3 and cyanocobalamin. 

E. Similac® Infant Formula 

Usage: When an infant formula is needed: if the decision is made to 
25 discontinue breastfeeding before age 1 year, if a supplement to breastfeeding is 

needed or as a routine feeding if breastfeeding is not adopted. 
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Features: 

• Protein of appropriate quality and quantity for good growth; 
heat-denatured, which reduces the risk of milk-associated enteric blood 
loss. 

5 • Fat from a blend of vegetable oils (doubly homogenized), 

providing essential linoleic acid that is easily absorbed. 

• Carbohydrate as lactose in proportion similar to that of human 
milk. 

• Low renal solute load to minimize stress on developing organs. 

10 • Powder, Concentrated Liquid and Ready To Feed forms. 

Ingredients: (®-D) Water, nonfat milk, lactose, soy oil, coconut oil, 
mono- and diglycerides, soy lecithin, abscorbic acid, carrageenan, choline 
chloride, taurine, m-inositol, alpha-tocopheryl acetate, zinc sulfate, niacinamid, 
ferrous sulfate, calcium pantothenate, cupric sulfate, vitamin A palmitate, 
1 5 thiamine chloride hydrochloride, riboflavin, pyridoxine hydrochloride, folic 

acid, manganese sulfate, phylloquinone, biotin, sodium selenite, vitamin D 3 and 
cyanocobalamin 

F. Similac® NeoCare Premature Infant Formula With Iron 

Usage: For premature infants 1 special nutritional needs after hospital 
20 discharge. Similac NeoCare is a nutritionally complete formula developed to 

provide premature infants with extra calories, protein, vitamins and minerals 
needed to promote catch-up growth and support development. 

Features: 

• Reduces the need for caloric and vitamin supplementation. More 
25 calories (22 Cal/fl oz) then standard term formulas (20 Cal/fl oz). 

• Highly absorbed fat blend, with medium-chain triglycerides 
(MCT oil) to help meet the special digestive needs of premature infants. 

• Higher levels of protein, vitamins and minerals per 100 Calories 
to extend the nutritional support initiated in-hospital. 
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• More calcium and phosphorus for improved bone mineralization. 

Ingredients: ®-D Corn syrup solids, nonfat milk, lactose, whey protein 
concentrate, soy oil, high-oleic safflower oil, fractionated coconut oil (medium- 
chain triglycerides), coconut oil, potassium citrate, calcium phosphate tribasic, 

5 calcium carbonate, ascorbic acid, magnesium chloride, potassium chloride, 

sodium chloride, taurine, ferrous sulfate, m-inositol, choline chloride, ascorbyl 
palmitate, L-carnitine, alpha-tocopheryl acetate, zinc sulfate, niacinamide, 
mixed tocopherols, sodium citrate, calcium pantothenate, cupric sulfate, 
thiamine chloride hydrochloride, vitamin A palmitate, beta carotene, riboflavin, 

10 pyridoxine hydrochloride, folic acid, manganese sulfate, phylloquinone, biotin, 

sodium selenite, vitamin D3 and cyanocobalamin. 

G. Similac Natural Care Low-Iron Human Milk Fortifier Ready 
To Use, 24 Cal/fl oz. 

Usage: Designed to be mixed with human milk or to be fed alternatively 
1 5 with human milk to low-birth- weight infants. 

Ingredients: ^-D Water, nonfat milk, hydrolyzed cornstarch, lactose, 
fractionated coconut oil (medium-chain triglycerides), whey protein 
concentrate, soil oil, coconut oil, calcium phosphate tribasic, potassium citrate, 
magnesium chloride, sodium citrate, ascorbic acid, calcium carbonate, mono- 

20 and diglycerides, soy lecithin, carrageenan, choline chloride, m-inositol, taurine, 

niacinamide, L-carnitine, alpha tocopheryl acetate, zinc sulfate, potassium 
chloride, calcium pantothenate, ferrous sulfate, cupric sulfate, riboflavin, 
vitamin A palmitate, thiamine chloride hydrochloride, pyridoxine 
hydrochloride, biotin, folic acid, manganese sulfate, phylloquinone, vitamin D3, 

25 sodium selenite and cyanocobalamin. 

Various PUFAs of this invention can be substituted and/or added to the 
infant formulae described above and to other infant formulae known to those in 
the art.. 
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II. NUTRITIONAL FORMULATIONS 

A. ENSURE® 

Usage: ENSURE is a low-residue liquid food designed primarily as an 
oral nutritional supplement to be used with or between meals or, in appropriate 
5 amounts, as a meal replacement. ENSURE is lactose- and gluten-free, and is 

suitable for use in modified diets, including low-cholesterol diets. Although it 
is primarily an oral supplement, it can be fed by tube. 

Patient Conditions: 

• For patients on modified diets 

10 • For elderly patients at nutrition risk 

• For patients with involuntary weight loss 

• For patients recovering from illness or surgery 

• For patients who need a low-residue diet 
Ingredients: 

1 5 ®-D Water, Sugar (Sucrose), Maltodextrin (Corn), Calcium and Sodium 

Caseinates, High-Oleic Safflower Oil, Soy Protein Isolate, Soy Oil, Canola Oil, 
Potassium Citrate, Calcium Phosphate Tribasic, Sodium Citrate, Magnesium 
Chloride, Magnesium Phosphate Dibasic, Artificial Flavor, Sodium Chloride, 
Soy Lecithin, Choline Chloride, Ascorbic Acid, Carrageenan, Zinc Sulfate, 

20 Ferrous Sulfate, Alpha-Tocopheryl Acetate, Gellan Gum, Niacinamide, 
Calcium Pantothenate, Manganese Sulfate, Cupric Sulfate, Vitamin A 
Palmitate, Thiamine Chloride Hydrochloride, Pyridoxine Hydrochloride, 
Riboflavin, Folic Acid, Sodium Molybdate, Chromium Chloride, Biotin, 
Potassium Iodide, Sodium Selenate. 

25 

B. ENSURE® BARS 

Usage: ENSURE BARS are complete, balanced nutrition for 
supplemental use between or with meals. They provide a delicious, nutrient- 
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rich alternative to other snacks. ENSURE BARS contain <1 g lactose/bar, and 
Chocolate Fudge Brownie flavor is gluten-free. (Honey Graham Crunch flavor 
contains gluten.) 

Patient Conditions: 

5 • For patients who need extra calories, protein, vitamins and minerals 

• Especially useful for people who do not take in enough calories and 
nutrients 

• For people who have the ability to chew and swallow 

• Not to be used by anyone with a peanut allergy or any type of allergy to 
10 nuts. 

Ingredients: 

Honey Graham Crunch - High-Fructose Corn Syrup, Soy Protein^ 
Isolate, Brown Sugar, Honey, Maltodextrin (Corn), Crisp Rice (Milled Rice, 
Sugar [Sucrose], Salt [Sodium Chloride] and Malt), Oat Bran, Partially 
15 Hydrogenated Cottonseed and Soy Oils, Soy Polysaccharide, Glycerine, Whey 

Protein Concentrate, Polydextrose, Fructose, Calcium Caseinate, Cocoa 
Powder, Artificial Flafors, Canola Oil, High-Oleic Safflower Oil, Nonfat Dry 
Milk, Whey Powder, Soy Lecithin and Corn Oil. Manufactured in a facility that 
processes nuts. 

20 Vitamins and Minerals: 

Calcium Phosphate Tribasic, Potassium Phosphate Dibasic, Magnesium 
Oxide, Salt (Sodium Chloride), Potassium Chloride, Ascorbic Acid, Ferric 
Orthophosphate, Alpha-Tocopheryl Acetate, Niacinamide, Zinc Oxide, Calcium 
Pantothenate, Copper Gluconate, Manganese Sulfate, Riboflavin, Beta- 
25 Carotene, Pyridoxine Hydrochloride, Thiamine Mononitrate, Folic Acid, Biotin, 

Chromium Chloride, Potassium Iodide, Sodium Selenate, Sodium Molybdate, 
Phylloquinone, Vitamin D3 and Cyanocobalamin. 
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Protein: 

Honey Graham Crunch - The protein source is a blend of soy protein isolate 
and milk proteins. 

Soy protein isolate 74% 
5 Milk proteins 26% 

Fat: 

Honey Graham Crunch - The fat source is a blend of partially 
hydrogenated cottonseed and soybean, canola, high oleic safflower, and corn 
oils, and soy lecithin. 

1 0 Partially hydrogenated cottonseed and soybean oil 76% 



Canola oil 8% 

High-oleic safflower oil 8% 

Corn oil 4% 

Soy lecithin 4% 

15 Carbohydrate: 



Honey Graham Crunch - The carbohydrate source is a combination of 
high-fructose corn syrup, brown sugar, maltodextrin, honey, crisp rice, 
glycerine, soy polysaccharide, and oat bran. 



High-fructose corn syrup 24% 

20 Brown sugar 21% 

Maltodextrin 12% 

Honey \\% 

Crisp rice 9% 

Glycerine 9% 

25 Soy polysaccharide 7% 

Oat bran 7%\ 
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C. ENSURE® HIGH PROTEIN 

Usage: ENSURE HIGH PROTEIN is a concentrated, high-protein 
liquid food designed for people who require additional calories, protein, 
vitamins, and minerals in their diets. It can be used as an oral nutritional 
5 supplement with or between meals or, in appropriate amounts, as a meal 

replacement. ENSURE HIGH PROTEIN is lactose- and gluten-free, and is 
suitable for use by people recovering from general surgery or hip fractures and 
by patients at risk for pressure ulcers. 

Patient Conditions 

10 • For patients who require additional calories, protein, vitamins, and minerals, 

such as patients recovering from general surgery or hip fractures, patients at risk 
for pressure ulcers, and patients on low-cholesterol diets 

Features- 

• Low in saturated fat 

15 • Contains 6 g of total fat and < 5 mg of cholesterol per serving 

• Rich, creamy taste 

_ • Excellent source of protein, calcium, and other essential vitamins and 
minerals 

• For low-cholesterol diets 

20 • Lactose-free, easily digested 

Ingredients: 

Vanilla Supreme: -®-D Water, Sugar (Sucrose), Maltodextrin (Corn), Calcium 
and Sodium Caseinates, High-Oleic Safflower Oil, Soy Protein Isolate, Soy Oil, 
Canola Oil, Potassium Citrate, Calcium Phosphate Tribasic, Sodium Citrate, 
25 Magnesium Chloride, Magnesium Phosphate Dibasic, Artificial Flavor, Sodium 

Chloride, Soy Lecithin, Choline Chloride, Ascorbic Acid, Carrageenan, Zinc 
Sulfate, Ferrous Suffate, Alpha-Tocopheryl Acetate, Gellan Gum, Niacinamide, 
Calcium Pantothenate, Manganese Sulfate, Cupric Sulfate, Vitamin A 
Palmitate, Thiamine Chloride Hydrochloride, Pyridoxine Hydrochloride, 
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Riboflavin, Folio Acid, Sodium Motybdate, Chromium Chloride, Biotin, 
Potassium Iodide, Sodium Selenate, Phylloquinone, Vitamin D.3 and 
Cyanocobalarnin. 

Protein: 

5 The protein source is a blend of two high-biologic-value proteins: casein and 

soy. 

Sodium and calcium caseinates 85% 
Soy protein isolate 1 5% 

Fat: 

10 The fat source is a blend of three oils: high-oleic safflower, canola, and soy. 

High-oleic safflower oil 40% 

Canola oil 30% 

Soy oil 30% 

The level of fat in ENSURE HIGH PROTEIN meets American Heart 
1 5 Association (AHA) guidelines. The 6 grams of fat in ENSURE HIGH 

PROTEIN represent 24% of the total calories, with 2.6% of the fat being from 
saturated fatty acids and 7.9% from polyunsaturated fatty acids. These values 
are within the AHA guidelines of < 30% of total calories from fat, < 1 0% of the 
calories from saturated fatty acids, and < 1 0% of total calories from 
20 polyunsaturated fatty acids. 

Carbohydrate: 

ENSURE HIGH PROTEIN contains a combination of maltodextrin and 
sucrose. The mild sweetness and flavor variety (vanilla supreme, chocolate 
royal, wild berry, and banana), plus VARI-FLAVORSO® Flavor Pacs in pecan, 
25 cherry, strawberry, lemon, and orange, help to prevent flavor fatigue and aid in 

patient compliance. 

Vanilla and other nonchocolate flavors 

Sucrose 60% 
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Maltodextrin 
Chocolate 

Sucrose 
Maltodextrin 

5 

D. ENSURE ® LIGHT 

Usage: ENSURE LIGHT is a low-fat liquid food designed for use as an 
oral nutritional supplement with or between meals. ENSURE LIGHT is 
lactose- and gluten-free, and is suitable for use in modified diets, including low- 
1 0 cholesterol diets . 

Patient Conditions: 

• For normal-weight or overweight patients who need extra nutrition in a 
supplement that contains 50% less fat and 20% fewer calories than ENSURE 

• For healthy adults who don't eat right and need extra nutrition 
15 Features: 

• Low in fat and saturated fat 

• Contains 3 g of total fat per serving and < 5 mg cholesterol 

• Rich, creamy taste 

• Excellent source of calcium and other essential vitamins and minerals 
20 • For low-cholesterol diets 

• Lactose-free, easily digested 
Ingredients: 

French Vanilla: ®-D Water, Maltodextrin (Corn), Sugar (Sucrose), Calcium 
Caseinate, High-Oleic Safflower Oil, Canola Oil, Magnesium Chloride, Sodium 
25 Citrate, Potassium Citrate, Potassium Phosphate Dibasic, Magnesium Phosphate 

Dibasic, Natural and Artificial Flavor, Calcium Phosphate Tribasic, Cellulose 
Gel, Choline Chloride, Soy Lecithin, Carrageenan, Salt (Sodium Chloride), 
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Ascorbic Acid, Cellulose Gum, Ferrous Sulfate, Alpha-Tocopheryl Acetate, 
Zinc Sulfate, Niacinamide, Manganese Sulfate, Calcium Pantothenate, Cupric 
Sulfate, Thiamine Chloride Hydrochloride, Vitamin A Palmitate, Pyridoxine 
Hydrochloride, Riboflavin, Chromium Chloride, Folic Acid, Sodium 
5 Molybdate, Biotin, Potassium Iodide, Sodium Selenate, Phylloquinone, Vitamin 

D 3 and Cyanocobalamin. 

Protein: 

The protein source is calcium caseinate. 

Calcium caseinate 1 00% 

10 Fat 

The fat source is a blend of two oils: high-oleic safflower and canola. 
High-oleic safflower oil 70% 
Canola oil 30% 

The level of fat in ENSURE LIGHT meets American Heart Association 
1 5 (AHA) guidelines. The 3 grams of fat in ENSURE LIGHT represent 1 3.5% of 

the total calories, with 1.4% of the fat being from saturated fatty acids and 2.6% 
from polyunsaturated fatty acids. These values are within the AHA guidelines 
of < 30% of total calories from fat, < 1 0% of the calories from saturated fatty 
acids, and < 1 0% of total calories from polyunsaturated fatty acids. 

20 Carbohydrate 

ENSURE LIGHT contains a combination of maltodextrin and sucrose. 
The chocolate flavor contains corn syrup as well. The mild sweetness and 
flavor variety (French vanilla, chocolate supreme, strawberry swirl), plus 
VARI-FLAVORS® Flavor Pacs in pecan, cherry, strawberry, lemon, and 
25 orange, help to prevent flavor fatigue and aid in patient compliance. 

Vanilla and other nonchocolate flavors 

Sucrose 51<>/ 0 
Maltodextrin 49% 



-119- 



WO 98/46764 




PO7US98/07421 



Chocolate 

Sucrose 47.0% 

Corn Syrup 26.5% 

Maltodextrin 26.5% 

5 Vitamins and Minerals 

An 8-fl-oz serving of ENSURE LIGHT provides at least 25% of the 
RDIs for 24 key vitamins and minerals. 

Caffeine 

Chocolate flavor contains 2.1 mg caffeine/8 fl oz. 

10 

E. ENSURE PLUS® 

Usage: ENSURE PLUS is a high-calorie, low-residue liquid food for 
use when extra calories and nutrients, but a normal concentration of protein, are 
needed. It is designed primarily as an oral nutritional supplement to be used 
1 5 with or between meals or, in appropriate amounts, as a meal replacement. 

ENSURE PLUS is lactose- and gluten-free. Although it is primarily an oral 
nutritional supplement, it can be fed by tube. 

Patient Conditions: 

• For patients who require extra calories and nutrients, but a normal 
20 concentration of protein, in a limited volume 

• For patients who need to gain or maintain healthy weight 
Features 

• Rich, creamy taste 

• Good source of essential vitamins and minerals 

25 Ingredients 

Vanilla: ®-D Water, Corn Syrup, Maltodextrin (Com), Corn Oil, Sodium and 
Calcium Caseinates, Sugar (Sucrose), Soy Protein Isolate, Magnesium Chloride, 
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Potassium Citrate, Calcium Phosphate Tribasic, Soy Lecithin, Natural and 
Artificial Flavor, Sodium Citrate, Potassium Chloride, Choline Chloride, 
Ascorbic Acid, Carrageenan, Zinc Sulfate, Ferrous Sulfate, Alpha-Tocopheryl 
Acetate, Niacinamide, Calcium Pantothenate, Manganese Sulfate, Cupric 
5 Sulfate, Thiamine Chloride Hydrochloride, Pyridoxine Hydrochloride, 

Riboflavin, Vitamin A Palmitate, Folic Acid, Biotin, Chromium Chloride, 
Sodium Molybdate, Potassium Iodide, Sodium Selenite, Phylloquinone, 
Cyanocobalamin and Vitamin D 3 . 

Protein 

1 0 The protein source is a blend of two high-biologic-value proteins: casein 

and soy. 

Sodium and calcium caseinates 84% 
Soy protein isolate 1 6% 

Fat 

1 5 The fat source is corn oil. 

Corn oil 100% 
Carbohydrate 

ENSURE PLUS contains a combination of maltodextrin and sucrose. 
The mild sweetness and flavor variety (vanilla, chocolate, strawberry, coffee, 
20 buffer pecan, and eggnog), plus VARI-FLAVORS® Flavor Pacs in pecan, 

cherry, strawberry, lemon, and orange, help to prevent flavor fatigue and aid in 
patient compliance. 

Vanilla, strawberry, butter pecan, and coffee flavors 



Corn Syrup 29% 

25 Maltodextrin 38% 

Sucrose 23% 
Chocolate and eggnog flavors 

Corn Syrup 35% 
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Maltodextrin 34% 

Sucrose 30% 

Vitamins and Minerals 

An 8-fl-oz serving of ENSURE PLUS provides at least 15% of the RDIs 
5 for 25 key Vitamins and minerals. 

Caffeine 

Chocolate flavor contains 3.1 mg Caffeine/8 fl oz. Coffee flavor 
contains a trace amount of caffeine. 



1 0 F. ENSURE PLUS® HN 

Usage: ENSURE PLUS HN is a nutritionally complete high-calorie, 
high-nitrogen liquid food designed for people with higher calorie and protein 
needs or limited volume tolerance. It may be used for oral supplementation or 
for total nutritional support by tube. ENSURE PLUS HN is lactose- and gluten- 
15 free. 

Patient Conditions: 

• For patients with increased calorie and protein needs, such as following 
surgery or injury 

• For patients with limited volume tolerance and early satiety 
20 Features 

• For supplemental or total nutrition 

• For oral or tube feeding 

• 1.5CaVmL 

• High nitrogen 
25 • Calorically dense 
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Ingredients 

Vanilla: ©-D Water, Maltodextrin (Corn), Sodium and Calcium Caseinates, 
Corn Oil, Sugar (Sucrose), Soy Protein Isolate, Magnesium Chloride, Potassium 
Citrate, Calcium Phosphate Tribasic, Soy Lecithin, Natural and Artificial 
Flavor, Sodium Citrate, Choline Chloride, Ascorbic Acid, Taurine, L-Carnitine, 
Zinc Sulfate, Ferrous Sulfate, Alpha-Tocopheryl Acetate, Niacinamide, 
Carrageenan, Calcium Pantothenate, Manganese Sulfate, Cupric Sulfate, 
Thiamine Chloride Hydrochloride, Pyridoxine Hydrochloride, Riboflavin, 
Vitamin A Palmitate, Folic Acid, Biotin, Chromium Chloride, Sodium 
Molybdate, Potassium Iodide, Sodium Selenite, Phylloquinone, 
Cyanocobalamin and Vitamin D 3 . 

G. ENSURE® POWDER 

Usage: ENSURE POWDER (reconstituted with water) is a low-residue 
liquid food designed primarily as an oral nutritional supplement to be used with 
or between meals. ENSURE POWDER is lactose- and gluten-free, and is 
suitable for use in modified diets, including low-cholesterol diets. 

Patient Conditions: 

• For patients on modified diets 

• For elderly patients at nutrition risk 

• For patients recovering from illness/surgery 

• For patients who need a low-residue diet 
Features 

• Convenient, easy to mix 



25 



Low in saturated fat 



Contains 9 g of total fat and < 5 mg of cholesterol per serving 



High in vitamins and minerals 



For low-cholesterol diets 
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• Lactose-free, easily digested 

Ingredients: ®-D Corn Syrup, Maltodextrin (Corn), Sugar (Sucrose), Corn Oil, 
Sodium and Calcium Caseinates, Soy Protein Isolate, Artificial Flavor, 
Potassium Citrate, Magnesium Chloride, Sodium Citrate, Calcium Phosphate 

5 Tribasic, Potassium Chloride, Soy Lecithin, Ascorbic Acid, Choline Chloride, 

Zinc Sulfate, Ferrous Sulfate, Alpha-Tocopheryl Acetate, Niacinamide, 
Calcium Pantothenate, Manganese Sulfate, Thiamine Chloride Hydrochloride, 
Cupric Sulfate, Pyridoxine Hydrochloride, Riboflavin, Vitamin A Palmitate, 
Folic Acid, Biotin, Sodium Molybdate, Chromium Chloride, Potassium Iodide, 

10 Sodium Selenate, Phylloquinone, Vitamin D3 and Cyanocobalamin. 

Protein 

The protein source is a blend of two high-biologic-value proteins: casein 
and soy. 

Sodium and calcium caseinates 84% 
15 Soy protein isolate 16% 

Fat 

The fat source is corn oil. 

Corn oil 100% 
Carbohydrate 

20 ENSURE POWDER contains a combination of com syrup, 

maltodextrin, and sucrose. The mild sweetness of ENSURE POWDER, plus 
VARI-FLAVORS® Flavor Pacs in pecan, cherry, strawberry, lemon, and 
orange, helps to prevent flavor fatigue and aid in patient compliance. 



Vanilla 

25 Corn Syrup 35% 

Maltodextrin 35% 

Sucrose 30% 
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H. ENSURE® PUDDING 

Usage: ENSURE PUDDING is a nutrient-dense supplement providing 
balanced nutrition in a nonliquid form to be used with or between meals. It is 
appropriate for consistency-modified diets (e.g., soft, pureed, or full liquid) or 
5 for people with swallowing impairments. ENSURE PUDDING is gluten-free. 
Patient Conditions: 

• For patients on consistency-modified diets (e.g., soft, pureed, or full liquid) 

• For patients with swallowing impairments 
Features 

1 0 • Rich and creamy, good taste 

• Good source of essential vitamins and minerals Convenient-needs no 
refrigeration 

• Gluten-free 

Nutrient Profile per 5 oz: Calories 250, Protein 10.9%, Total Fat 34.9%, 
15 Carbohydrate 54.2% 

Ingredients: 

Vanilla: ©-D Nonfat Milk, Water, Sugar (Sucrose), Partially Hydrogenated 
Soybean Oil, Modified Food Starch, Magnesium Sulfate. Sodium Stearoyl 
Lactylate, Sodium Phosphate Dibasic, Artificial Flavor, Ascorbic Acid, Zinc 

20 Sulfate, Ferrous Sulfate, Alpha-Tocopheryl Acetate, Choline Chloride, 

Niacinamide, Manganese Sulfate, Calcium Pantothenate, FD&C Yellow #5, 
Potassium Citrate, Cupric Sulfate, Vitamin A Palmitate, Thiamine Chloride 
Hydrochloride, Pyridoxine Hydrochloride, Riboflavin, FD&C Yellow #6, Folic 
Acid, Biotin, Phylloquinone, Vitamin D3 and Cyanocobalamin. 

25 Protein 

The protein source is nonfat milk. 

Nonfat milk 100% 
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Fat 

The fat source is hydrogenated soybean oil. 
Hydrogenated soybean oil 100% 
Carbohydrate 

5 ENSURE PUDDING contains a combination of sucrose and modified 

food starch. The mild sweetness and flavor variety (vanilla, chocolate, 
butterscotch, and tapioca) help prevent flavor fatigue. The product contains 9.2 
grams of lactose per serving. 

Vanilla and other nonchocolate flavors 



10 Sucrose 56% 

Lactose 27% 

Modified food starch 1 7% 
Chocolate 

Sucrose 58% 

15 Lactose 26% 

Modified food starch 1 6% 



I. ENSURE® WITH FIBER 

Usage: ENSURE WITH FIBER is a fiber-containing, nutritionally 
20 complete liquid food designed for people who can benefit from increased 

dietary fiber and nutrients. ENSURE WITH FIBER is suitable for people who 
do not require a low-residue diet. It can be fed orally or by tube, and can be 
used as a nutritional supplement to a regular diet or, in appropriate amounts, as 
a meal replacement. ENSURE WITH FIBER is lactose- and gluten-free, and is 
25 suitable for use in modified diets, including low-cholesterol diets. 

Patient Conditions 

• For patients who can benefit from increased dietary fiber and nutrients 
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Features 



• New advanced formula-low in saturated fat, higher in vitamins and minerals 

• Contains 6 g of total fat and < 5 mg of cholesterol per serving 

• Rich, creamy taste 



• Excellent source of essential vitamins and minerals 

• For low-cholesterol diets 

• Lactose- and gluten-free 
Ingredients 

Vanilla: ©-D Water, Maltodextrin (Corn), Sugar (Sucrose), Sodium and 
Calcium Caseinates, Oat Fiber, High-Oleic Safflower Oil, Canola Oil, Soy 
Protein Isolate, Corn Oil, Soy Fiber, Calcium Phosphate Tribasic, Magnesium 
Chloride, Potassium Citrate, Cellulose Gel, Soy Lecithin, Potassium Phosphate 
Dibasic, Sodium Citrate, Natural and Artificial Flavors, Choline Chloride, 
Magnesium Phosphate, Ascorbic Acid, Cellulose Gum, Potassium Chloride, 
Carrageenan, Ferrous Sulfate, Alpha-Tocopheryl Acetate, Zinc Sulfate, 
Niacinamide, Manganese Sulfate, Calcium Pantothenate, Cupric Sulfate, 
Vitamin A Palmitate, Thiamine Chloride Hydrochloride, Pyridoxine 
Hydrochloride, Riboflavin, Folic Acid, Chromium Chloride, Biotin, Sodium 
Molybdate, Potassium Iodide, Sodium Selenate, Phylloquinone, Vitamin D 3 and 
Cyanocobalamin. 

Protein 

The protein source is a blend of two high-biologic-value proteins- casein 



5 



• Good source of fiber 



and soy. 



25 



Sodium and calcium caseinates 



80% 



Soy protein isolate 



20% 
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Fat 

The fat source is a blend of three oils: high-oleic safflower, canola, and 



corn. 

High-oleic safflower oil 40% 

5 Canola oil 40% 

Corn oil 20% 



The level of fat in ENSURE WITH FIBER meets American Heart 
Association (AHA) guidelines. The 6 grams of fat in ENSURE WITH FIBER 
represent 22% of the total calories, with 2.01 % of the fat being from saturated 
10 fatty acids and 6.7% from polyunsaturated fatty acids. These values are within 

the AHA guidelines of < 30% of total calories from fat, < 1 0% of the calories 
from saturated fatty acids, and < 1 0% of total calories from polyunsaturated 
fatty acids. 

Carbohydrate 

1 5 ENSURE WITH FIBER contains a combination of maltodextrin and 

sucrose. The mild sweetness and flavor variety (vanilla, chocolate, and butter 
pecan), plus VARI-FLAVORS® Flavor Pacs in pecan, cherry, strawberry, 
lemon, and orange, help to prevent flavor fatigue and aid in patient compliance. 

Vanilla and other nonchocolate flavors 



20 Maltodextrin 66% 

Sucrose 25% 

Oat Fiber 7% 

Soy Fiber 2% 
Chocolate 

25 Maltodextrin 55% 

Sucrose 36% 

Oat Fiber 7% 
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Soy Fiber 2% 

Fiber 

The fiber blend used in ENSURE WITH FIBER consists of oat fiber and 
soy polysaccharide. This blend results in approximately 4 grams of total dietary 
5 fiber per 8-fl-oz can. The ratio of insoluble to soluble fiber is 95:5. 

The various nutritional supplements described above and known to 
others of skill in the art can be substituted and/or supplemented with the PUFAs 
of this invention. 

J. Oxepa™ Nutritional Product 

10 Oxepa is low-carbohydrate, calorically dense enteral nutritional product 

designed for the dietary management of patients with or at risk for ARDS. It 
has a unique combination of ingredients, including a patented oil blend 
containing eicosapentaenoic acid (EPA from fish oil), y-Hnolenic acid (GLA 
from borage oil), and elevated antioxidant levels. 

1 5 Caloric Distribution: 

• Caloric density is high at 1 .5 Cal/mL (355 Cal/8 fl oz), to minimize the 
volume required to meet energy needs. 



The distribution of Calories in Oxepa is shown in Table 7. 



Table 7. Caloric Distribution of Oxepa 




per 8 fl oz. 


per liter 


%ofCal 


Calories 


l 355 


1,500 




Fat(g) 


22.2 


93.7 


55.2 


Carbohydrate (g) 


25 


105.5 


28.1 


Protein (g) 


14.8 


62.5 


16.7 


Water (g) 


186 


785 





20 Fat: 

• Oxepa contains 22.2 g of fat per 8-fl oz serving (93 .7 g/L). 

• The fat source is a oil blend of 3 1 .8% canola oil, 25% medium-chain 
triglycerides (MCTs), 20% borage oil, 20% fish oil, and 3.2 % soy lecithin. The 
typical fatty acid profile of Oxepa is shown in Table 8. 
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• Oxepa provides a balanced amount of polyunsaturated, monounsaturated, 
and saturated fatty acids, as shown in Table 10. 

• Medium-chain trigylcerides (MCTs) - 25% of the fat blend - aid gastric 
emptying because they are absorbed by the intestinal tract without 

5 emulsification by bile acids. 

The various fatty acid components of Oxepa™ nutritional product can 
be substituted and/or supplemented with the PUFAs of this invention. 



Table 8. Typical Fatty Acid Profile 




% Total Fatty 
Acids 


g/8 fl oz* 


g/L* 


Caproic (6:0) 


0.2 


0.04 


0.18 


Caprylic (8:0) 


14.69 


3.1 


13.07 


Capric(10:0) 


11.06 


2.33 


9.87 


Palmitic (16:0) 


5.59 


1.18 


4.98 


Palmitoleic(16:ln-7) 


1.82 


0.38 


1.62 


Stearic (18:0) 


1.84 


0.39 


1.64 


Oleic (18:ln-9) 


24.44 


5.16 


21.75 


Linoleic(18:2n-6) 


16.28 


3.44 


14.49 


a-Linolenic (18:3n-3) 


3.47 


0.73 


3.09 


Y-Linolenic (18:3n-6) 


4.82 


1.02 


4.29 


Eicosapentaenoic (20:5n- 
3) 


5.11 


1.08 


4.55 


n-3-Docosapentaenoic 
(22:5n-3) 


0.55 


0.12 


0.49 


Docosahexaenoic (22:6n- 
3) 


2.27 


0.48 


2.02 


Others 


7.55 


1.52 


6.72 



* Fatty acids equal approximately 95% of total fat. 



Table 9. Fat Profile of Oxepa. 


% of total calories from fat 


55.2 


Polyunsaturated fatty acids 


31.44 g/L 


Monounsaturated fatty acids 


25.53 g/L 


Saturated fatty acids 


32.38 g/L 


n-6 to n-3 ratio 


1.75:1 


Cholesterol 


9.49 mg/8 fl oz 
40.1 mg/L 



10 
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Carbohydrate: 

• The carbohydrate content is 25.0 g per 8-fl-oz serving (105.5 g/L). 

• The carbohydrate sources are 45% maltodextrin (a complex carbohydrate) 
and 55% sucrose (a simple sugar), both of which are readily digested and 

5 absorbed. 

• The high-fat and low-carbohydrate content of Oxepa is designed to 
minimize carbon dioxide (CO2) production. High C0 2 levels can complicate 
weaning in ventilator-dependent patients. The low level of carbohydrate also 
may be useful for those patients who have developed stress-induced 

10 hyperglycemia. 

• Oxepa is lactose-free. 

Dietary carbohydrate, the amino acids from protein, and the glycerol 
moiety of fats can be converted to glucose within the body. Throughout this 
process, the carbohydrate requirements of glucose-dependent tissues (such as 

1 5 the central nervous system and red blood cells) are met. However, a diet free of 

carbohydrates can lead to ketosis, excessive catabolism of tissue protein, and 
loss of fluid and electrolytes. These effects can be prevented by daily ingestion 
of 50 to 100 g of digestible carbohydrate, if caloric intake is adequate. The 
carbohydrate level in Oxepa is also sufficient to minimize gluconeogenesis, if 

20 energy needs are being met. 

Protein: 

• Oxepa contains 14.8 g of protein per 8-fl-oz serving (62.5 g/L). 

• The total calorie/nitrogen ratio (150:1) meets the need of stressed patients. 

• Oxepa provides enough protein to promote anabolism and the maintenance 
25 of lean body mass without precipitating respiratory problems. High protein 

intakes are a concern in patients with respiratory insufficiency. Although 
protein has little effect on C0 2 production, a high protein diet will increase 
ventilatory drive. 
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• The protein sources of Oxepa are 86.8% sodium caseinate and 13.2% 
calcium caseinate. 

• As demonstrated in Table 1 1, the amino acid profile of the protein system in 
Oxepa meets or surpasses the standard for high quality protein set by 

5 theNational Academy of Sciences. 

• Oxepa is gluten-free. 

All publications and patent applications mentioned in this specification 
are indicative of the level of skill of those skilled in the art to which this 
10 invention pertains. All publications and patent applications are herein 

incorporated by reference to the same extent as if each individual publication or 
patent application was specifically and individually indicated to be incorporated 
by reference. 

The invention now being fully described, it will be apparent to one of 
1 5 ordinary skill in the art that many changes and modifications can be made 

thereto without departing from the spirit or scope of the appended claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: KNUTZON, DEBORAH 
MURKER J I , PRADIP 
HUANG , YUNG-SHENG 
THURMOND, JENNIFER 
CHAUDHARY, SUNITA 
LEONARD, AMANDA 

(ii) TITLE OF INVENTION: METHODS AND COMPOSITIONS FOR SYNTHESIS 
OF LONG CHAIN POLY-UNSATURATED FATTY ACIDS IN PLANTS 

(iii) NUMBER OF SEQUENCES: 52 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: LIMBACH & LIMBACH L.L.P. 

(B) STREET: 2001 FERRY BUILDING 

(C) CITY: SAN FRANCISCO 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 94111 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE r Microsoft Word 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/834,033 

(B) FILING DATE: ll-APR-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/833,610 

(B) FILING DATE: ll-APR-1997 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: MICHAEL R. WARD 

(B) REGISTRATION NUMBER: 38,351 

(C) REFERENCE/ DOCKET NUMBER: CGAB-320 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (415) 433-4150 

(B) TELEFAX: (415) 433-8716 

(C) TELEX: N/A 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1617 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CGACACTCCT TCCTTCTTCT CACCCGTCCT AGTCCCCTTC AACCCCCCTC TTTGACAAAG 60 

ACAACAAACC ATGGCTGCTG CTCCCAGTGT GAGGACGTTT ACTCGGGCCG AGGTTTTGAA 120 

15 

TGCCGAGGCT CTGAATGAGG GCAAGAAGGA TGCCGAGGCA CCCTTCTTGA TGATCATCGA 180 

CAACAAGGTG TACGATGTCC GCGAGTTCGT CCCTGATCAT CCCGGTGGAA GTGTGATTCT 240 

20 CACGCACGTT GGCAAGGACG GCACTGACGT CTTTGACACT TTTCACCCCG AGGCTGCTTG 300 

GGAGACTCTT GCCAACTTTT ACGTTGGTGA TATTGACGAG AGCGACCGCG ATATCAAGAA 360 

TGATGACTTT GCGGCCGAGG TCCGCAAGCT GCGTACCTTG TTCCAGTCTC TTGGTTACTA 42 0 

25 

CGATTCTTCC AAGGCATACT ACGCCTTCAA GGTCTCGTTC AACCTCTGCA TCTGGGGTTT 4 80 

GTCGACGGTC ATTGTGGCCA AGTGGGGCCA GACCTCGACC CTCGCCAACG TGCTCTCGGC 540 

30 TGCGCTTTTG GGTCTGTTCT GGCAGCAGTG CGGATGGTTG GCTCACGACT TTTTGCATCA 600 

CCAGGTCTTC CAGGACCGTT TCTGGGGTGA TCTTTTCGGC GCCTTCTTGG GAGGTGTCTG 660 

CCAGGGCTTC TCGTCCTCGT GGTGGAAGGA CAAGCACAAC ACTCACCACG CCGCCCCCAA 72 0 

35 

CGTCCACGGC GAGGATCCCG ACATTGACAC CCACCCTCTG TTGACCTGGA GTGAGCATGC 7 80 

GTTGGAGATG TTCTCGGATG TCCCAGATGA GGAGCTGACC CGCATGTGGT CGCGTTTCAT 8 40 

40 GGTCCTGAAC CAGACCTGGT TTTACTTCCC CATTCTCTCG TTTGCCCGTC TCTCCTGGTG 900 

CCTCCAGTCC ATTCTCTTTG TGCTGCCTAA CGGTCAGGCC CACAAGCCCT CGGGCGCGCG 960 

TGTGCCCATC TCGTTGGTCG AGCAGCTGTC GCTTGCGATG CACTGGACCT GGTACCTCGC 102 0 

45 

CACCATGTTC CTGTTCATCA AGGATCCCGT CAACATGCTG GTGTACTTTT TGGTGTCGCA 1080 

GGCGGTGTGC GGAAACTTGT TGGCGATCGT GTTCTCGCTC AACCACAACG GTATGCCTGT 1140 

50 GATCTCGAAG GAGGAGGCGG TCGATATGGA TTTCTTCACG AAGCAGATCA TCACGGGTCG 12 00 

TGATGTCCAC CCGGGTCTAT TTGCCAACTG GTTCACGGGT GGATTGAACT ATCAGATCGA 12 60 

GCACCACTTG TTCCCTTCGA TGCCTCGCCA CAACTTTTCA AAGATCCAGC CTGCTGTCGA 1320 

55 

GACCCTGTGC AAAAAGTACA ATGTCCGATA CCACACCACC GGTATGATCG AGGGAACTGC 1380 

AGAGGTCTTT AGCCGTCTGA ACGAGGTCTC CAAGGCTGCC TCCAAGATGG GTAAGGCGCA 14 40 

60 GTAAAAAAAA AAACAAGGAC GTTTTTTTTC GCCAGTGCCT GTGCCTGTGC CTGCTTCCCT 1500 

TGTCAAGTCG AGCGTTTCTG GAAAGGATCG TTCAGTGCAG TATCATCATT CTCCTTTTAC 1560 
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15 



20 



30 



35 



45 



50 



CCCCCGCTCA TATCTCATTC ATTTCTCTTA TTAAACAACT TGTTCCCCCC TTCACCG 1617 



(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 457 amino acids 
10 (B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Ala Ala Ala Pro Ser Val Arg Thr Phe Thr Arg Ala Glu Val Leu 
1 5 10 15 

Asn Ala Glu Ala Leu Asn Glu Gly Lys Lys Asp Ala Glu Ala Pro Phe 
Z2> 20 25 30 

Leu Met lie He Asp Asn Lys Val Tyr Asp Val Arg Glu Phe Val Pro 
35 40 45 

Asp His Pro Gly Gly Ser Val He Leu Thr His Val Gly Lys Asp Gly 
50 55 60 

Thr Asp Val Phe Asp Thr Phe His Pro Glu Ala Ala Trp Glu Thr Leu 
65 7 0 75 ' 80 

Ala Asn Phe Tyr Val Gly Asp He Asp Glu Ser Asp Arg Asp He Lys 
85 90 95 

An Asn Asp Asp Phe Ala Ala Glu Val Arg Lys Leu Arg Thr Leu Phe Gin 

W 100 105 no 



Ser Leu Gly Tyr Tyr Asp Ser Ser Lys Ala Tyr Tyr Ala Phe Lys Val 
115 120 125 

Ser Phe Asn Leu Cys He Trp Gly Leu Ser Thr Val He Val Ala Lys 
130 135 140 

Trp Gly Gin Thr Ser Thr Leu Ala Asn Val Leu Ser Ala Ala Leu Leu 
145 150 155 160 

Gly Leu Phe Trp Gin Gin Cys Gly Trp Leu Ala His Asp Phe Leu His 
165 170 175 

» His Gln Vai Phe G l n Asp Arg Phe Trp Gly Asp Leu Phe Gly Ala Phe 

33 180 185 190 

Leu Gly Gly Val Cys Gin Gly Phe Ser Ser Ser Trp Trp Lys Asp Lys 
195 200 205 

60 His A f" Thr His Ala Ala Pro Asn Val His Gly Glu Asp Pro Asp 

2X0 215 220 
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lie Asp Thr His Pro Leu Leu Thr Trp Ser Glu His Ala Leu Glu Met 
225 230 235 240 

Phe Ser Asp Val Pro Asp Glu Glu Leu Thr Arg Met Trp Ser Arg Phe 
245 250 255 

Met Val Leu Asn Gin Thr Trp Phe Tyr Phe Pro lie Leu Ser Phe Ala 
260 265 270 

Arg Leu Ser Trp Cys Leu Gin Ser lie Leu Phe Val Leu Pro Asn Gly 
275 280 285 

Gin Ala His Lys Pro Ser Gly Ala Arg Val Pro lie Ser Leu Val Glu 
290 295 300 

Gin Leu Ser Leu Ala Met His Trp Thr Trp Tyr Leu Ala Thr Met Phe 
305 310 315 320 

Leu Phe lie Lys Asp Pro Val Asn Met Leu Val Tyr Phe Leu Val Ser 
325 330 335 

Gin Ala Val Cys Gly Asn Leu Leu Ala lie Val Phe Ser Leu Asn His 
340 345 350 

Asn Gly Met Pro Val lie Ser Lys Glu Glu Ala Val Asp Met Asp Phe 
355 360 365 

Phe Thr Lys Gin lie lie Thr Gly Arg Asp Val His Pro Gly Leu Phe 
370 375 380 

Ala Asn Trp Phe Thr Gly Gly Leu Asn Tyr Gin lie Glu His His Leu 
385 390 395 400 

Phe Pro Ser Met Pro Arg His Asn Phe Ser Lys lie Gin Pro Ala Val 
405 410 - 415 

Glu Thr Leu Cys Lys Lys Tyr Asn Val Arg Tyr His Thr Thr Gly Met 
420 425 430 

lie Glu Gly Thr Ala Glu Val Phe Ser Arg Leu Asn Glu Val Ser Lys 
435 440 445 

Ala Ala Ser Lys Met Gly Lys Ala Gin 
450 455 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 88 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GTCCCCTGTC GCTGTCGGCA CACCCCATCC TCCCTCGCTC CCTCTGCGTT TGTCCTTGGC 
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CCACCGTCTC 


TCCTCCACCC 


TCCGAGACGA 


CTGCAACTGT 


AATCAGGAAC 


CGACAAATAC 


120 




ACGATTTCTT 


TTTACTCAGC 


ACCAACTCAA 


AATCCTCAAC 


CGCAACCCTT 


TTTCAGGATG 


180 


5 


GCACCTCCCA 


ACACTATCGA 


TGCCGGTTTG 


ACCCAGCGTC 


ATATCAGCAC 


CTCGGCCCCA 


240 




AACTCGGCCA 


AGCCTGCCTT 


CGAGCGCAAC 


TACCAGCTCC 


CCGAGTTCAC 


CATCAAGGAG 


300 


10 


ATCCGAGAGT 


GCATCCCTGC 


CCACTGCTTT 


GAGCGCTCCG 


GTCTCCGTGG 


TCTCTGCCAC 


360 


GTTGCCATCG 


ATCTGACTTG 


GGCGTCGCTC 


TTGTTCCTGG 


CTGCGACCCA 


GATCGACAAG 


420 




TTTGAGAATC 


CCTTGATCCG 


CTATTTGGCC 


TGGCCTGTTT 


ACTGGATCAT 


GCAGGGTATT 


480 


15 


GTCTGCACCG 


GTGTCTGGGT 


GCTGGCTCAC 


GAGTGTGGTC 


ATCAGTCCTT 


CTCGACCTCC 


540 




AAGACCCTCA 


ACAACACAGT 


TGGTTGGATC 


TTGCACTCGA 


TGCTCTTGGT 


CCCCTACCAC 


600 


20 


TCCTGGAGAA 


TCTCGCACTC 


GAAGCACCAC 


AAGGCCACTG 


GCCATATGAC 


CAAGGACCAG 


660 


GTCTTTGTGC 


CCAAGACCCG 


CTCCCAGGTT 


GGCTTGCCTC 


CCAAGGAGAA 


CGCTGCTGCT 


720 




GCCGTTCAGG 


AGGAGGACAT 


GTCCGTGCAC 


CTGGATGAGG 


AGGCTCCCAT 


TGTGACTTTG 


780 


25 


TTCTGGATGG 


TGATCCAGTT 


CTTGTTCGGA 


TGGCCCGCGT 


ACCTGATTAT 


GAACGCCTCT 


840 




GGCCAAGACT 


ACGGCCGCTG 


GACCTCGCAC 


TTCCACACGT 


ACTCGCCCAT 


CTTTGAGCCC 


900 


30 


CGCAACTTTT 


TCGACATTAT 


TATCTCGGAC 


CTCGGTGTGT 


TGGCTGCCCT 


CGGTGCCCTG 


960 




ATCTATGCCT 


CCATGCAGTT 


GTCGCTCTTG 


ACCGTCACCA 


AGTACTATAT 


TGTCCCCTAC 


1020 




CTCTTTGTCA 


ACTTTTGGTT 


GGTCCTGATC 


ACCTTCTTGC 


AGCACACCGA 


TCCCAAGCTG 


1080 


35 


CCCCATTACC 


GCGAGGGTGC 


CTGGAATTTC 


CAGCGTGGAG 


CTCTTTGCAC 


CGTTGACCGC 


1140 




TCGTTTGGCA 


AGTTCTTGGA 


CCATATGTTC 


CACGGCATTG 


TCCACACCCA 


TGTGGCCCAT 


1200 


40 


CACTTGTTCT 


CGCAAATGCC 


GTTCTACCAT 


GCTGAGGAAG 


CTACCTATCA 


TCTCAAGAAA 


1260 




CTGCTGGGAG 


AGTACTATGT 


GTACGACCCA 


TCCCCGATCG 


TCGTTGCGGT 


CTGGAGGTCG 


1320 




TTCCGTGAGT 


GCCGATTCGT 


GGAGGATCAG 


GGAGACGTGG 


TCTTTTTCAA 


GAAGTAAAAA 


1380 


45 


AAAAGACAAT 


GGACCACACA 


CAACCTTGTC 


TCTACAGACC 


TACGTATCAT 


GTAGCCATAC 


1440 




CACTTCATAA 


AAGAACATGA 


GCTCTAGAGG 


CGTGTCATTC 


GCGCCTCC 




1488 



^ (2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 399 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
55 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

60 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
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10 



Met Ala Pro Pro Asn Thr He Asp Ala Gly Leu Thr Gin Arg His He 
15 10 15 

Ser Thr Ser Ala Pro Asn Ser Ala Lys Pro Ala Phe Glu Arg Asn Tyr 
20 25 30 

Gin Leu Pro Glu Phe Thr He Lys Glu He Arg Glu Cys He Pro Ala 
35 40 45 

His Cys Phe Glu Arg Ser Gly Leu Arg Gly Leu Cys His Val Ala He 
50 55 60 



15 



Asp Leu Thr Trp Ala Ser Leu Leu Phe Leu Ala Ala Thr Gin He Asp 
65 70 75 80 



Lys Phe Glu Asn Pro Leu He Arg Tyr Leu Ala Trp Pro Val Tyr Trp 
85 90 95 



20 



He Met Gin Gly He Val Cys Thr Gly Val Trp Val Leu Ala His Glu 
100 105 110 



25 



Cys Gly His Gin Ser Phe Ser Thr Ser Lys Thr Leu Asn Asn Thr Val 
115 120 125 

Gly Trp He Leu His Ser Met Leu Leu Val Pro Tyr His Ser Trp Arg 
130 135 140 



30 



He Ser His Ser Lys His His Lys Ala Thr Gly His Met Thr Lys Asp 
145 150 155 160 



Gin Val Phe Val Pro Lys Thr Arg Ser Gin Val Gly Leu Pro Pro Lys 
165 170 ' 175 



35 



Glu Asn Ala Ala Ala Ala Val Gin Glu Glu Asp Met Ser Val His Leu 
180 185 190 



40 



Asp Glu Glu Ala Pro He Val Thr Leu Phe Trp Met Val He Gin Phe 
195 200 205 

Leu Phe Gly Trp Pro Ala Tyr Leu He Met Asn Ala Ser Gly Gin Asp 
210 215 220 



45 



Tyr Gly Arg Trp Thr Ser His Phe His Thr Tyr Ser Pro He Phe Glu 
225 230 235 240 



50 



Pro Arg Asn Phe Phe Asp He He He Ser Asp Leu Gly Val Leu Ala 
245 250 255 

Ala Leu Gly Ala Leu He Tyr Ala Ser Met Gin Leu Ser Leu Leu Thr 
260 265 270 



55 



Val Thr Lys Tyr Tyr He Val Pro Tyr Leu Phe Val Asn Phe Trp Leu 
275 280 285 

Val Leu lie Thr Phe Leu Gin His Thr Asp Pro Lys Leu Pro His Tyr 
290 295 300 



60 



Arg Glu Gly Ala Trp Asn Phe Gin Arg Gly Ala Leu Cys Thr Val Asp 
305 310 315 320 
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Arg Ser Phe Gly Lys Phe Leu Asp His Met Phe His Gly He Val His 
325 330 " 335 

Thr His Val Ala His His Leu Phe Ser Gin Met Pro Phe Tyr His Ala 
5 340 345 350 

Glu Glu Ala Thr Tyr His Leu Lys Lys Leu Leu Gly Glu Tyr Tyr Val 
355 360 365 

10 T yr Asp Pro Ser Pro He Val Val Ala Val Trp Arg Ser Phe Arg Glu 

370 375 380 

Cys Arg Phe Val Glu Asp Gin Gly Asp Val Val Phe Phe Lys Lys 
385 390 395 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1483 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 





GCTTCCTCCA 


GTTCATCCTC 


CATTTCGCCA 


CCTGCATTCT 


TTACGACCGT 


TAAGCAAGAT 


60 




GGGAACGGAC 


CAAGGAAAAA 


CCTTCACCTG 


GGAAGAGCTG 


GCGGCCCATA 


ACACCAAGGA 


120 


35 


CGACCTACTC 


TTGGCCATCC 


GCGGCAGGGT 


GTACGATGTC 


ACAAAGTTCT 


TGAGCCGCCA 


180 




-TCCTGGTGGA 


GTGGACACTC 


TCCTGCTCGG 


AGCTGGCCGA 


GATGTTACTC 


CGGTCTTTGA 


240 


40 


GATGTATCAC 


GCGTTTGGGG 


CTGCAGATGC 


CATTATGAAG 


AAGTACTATG 


TCGGTACACT 


300 


GGTCTCGAAT 


GAGCTGCCCA 


TCTTCCCGGA 


GCCAACGGTG 


TTCCACAAAA 


CCATCAAGAC 


360 




GAGAGTCGAG 


GGCTACTTTA 


CGGATCGGAA 


CATTGATCCC 


AAGAATAGAC 


CAGAGATCTG 


420 


45 


GGGACGATAC 


GCTCTTATCT 


TTGGATCCTT 


GATCGCTTCC 


TACTACGCGC 


AGCTCTTTGT 


480 




GCCTTTCGTT 


GTCGAACGCA 


CATGGCTTCA 


GGTGGTGTTT 


GCAATCATCA 


TGGGATTTGC 


540 


50 


GTGCGCACAA 


GTCGGACTCA 


ACCCTCTTCA 


TGATGCGTCT 


CACTTTTCAG 


TGACCCACAA 


600 




CCCCACTGTC 


TGGAAGATTC 


TGGGAGCCAC 


GCACGACTTT 


TTCAACGGAG 


CATCGTACCT 


660 




GGTGTGGATG 


TACCAACATA 


TGCTCGGCCA 


TCACCCCTAC 


ACCAACATTG 


CTGGAGCAGA 


720 


55 


TCCCGACGTG 


TCGACGTCTG 


AGCCCGATGT 


TCGTCGTATC 


AAGCCCAACC 


AAAAGTGGTT 


780 




TGTCAACCAC 


ATCAACCAGC 


ACATGTTTGT 


TCCTTTCCTG 


TACGGACTGC 


TGGCGTTCAA 


840 


60 


GGTGCGCATT 


CAGGACATCA 


ACATTTTGTA 


CTTTGTCAAG 


ACCAATGACG 


CTATTCGTGT 


900 




CAATCCCATC 


TCGACATGGC 


ACACTGTGAT 


GTTCTGGGGC 


GGCAAGGCTT 


TCTTTGTCTG 


960 
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GTATCGCCTG ATTGTTCCCC TGCAGTATCT GCCCCTGGGC AAGGTGCTGC TCTTGTTCAC 1020 

GGTCGCGGAC ATGGTGTCGT CTTACTGGCT GGCGCTGACC TTCCAGGCGA ACCACGTTGT 1080 

TGAGGAAGTT CAGTGGCCGT TGCCTGACGA GAACGGGATC ATCCAAAAGG ACTGGGCAGC 1140 

TATGCAGGTC GAGACTACGC AGGATTACGC ACACGATTCG CACCTCTGGA CCAGCATCAC 1200 

TGGCAGCTTG AACTACCAGG CTGTGCACCA TCTGTTCCCC AACGTGTCGC AGCACCATTA 12 60 

TCCCGATATT CTGGCCATCA TCAAGAACAC CTGCAGCGAG TACAAGGTTC CATACCTTGT 1320 

CAAGGATACG TTTTGGCAAG CATTTGCTTC ACATTTGGAG CACTTGCGTG TTCTTGGACT 1380 

CCGTCCCAAG GAAGAGTAGA AGAAAAAAAG CGCCGAATGA AGTATTGCCC CCTTTTTCTC 14 40 

CAAGAATGGC AAAAG GAG AT CAAGTGGACA TTCTCTATGA AGA 148 3 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 6 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: not relevant 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Gly Thr Asp Gin Gly Lys Thr Phe Thr Trp Glu Glu Leu Ala Ala 
15 10 15 

His Asn Thr Lys Asp Asp Leu Leu Leu Ala lie Arg Gly Arg Val Tyr 
20 25 30 

Asp Val Thr Lys Phe Leu Ser Arg His Pro Gly Gly Val Asp Thr Leu 
35 40 45 

Leu Leu Gly Ala Gly Arg Asp Val Thr Pro Val Phe Glu Met Tyr His 
50 55 60 

Ala Phe Gly Ala Ala Asp Ala lie Met Lys Lys Tyr Tyr Val Gly Thr 
65 - 70 75 80 

Leu Val Ser Asn Glu Leu Pro lie Phe Pro Glu Pro Thr Val Phe His 
85 90 95 

Lys Thr lie Lys Thr Arg Val Glu Gly Tyr Phe Thr Asp Arg Asn He 
100 105 110 

Asp Pro Lys Asn Arg Pro Glu He Trp Gly Arg Tyr Ala Leu He Phe 
115 120 125 

Gly Ser Leu He Ala Ser Tyr Tyr Ala Gin Leu Phe Val Pro Phe Val 
130 135 140 

Val Glu Arg Thr Trp Leu Gin Val Val Phe Ala He He Met Gly Phe 
145 150 155 160 
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10 



25 



40 



55 



60 



Ala Cys Ala Gin Val Gly Leu Asn Pro Leu His Asp Ala Ser His Phe 
165 170 175 

) 

Ser Val Thr His Asn Pro Thr Val Trp Lys lie Leu Gly Ala Thr His 
180 185 190 

Asp Phe Phe Asn Gly Ala Ser Tyr Leu Val Trp Met Tyr Gin His Met 
195 200 205 

Leu Gly His His Pro Tyr Thr Asn He Ala Gly Ala Asp Pro Asp Val 
210 215 220 



Ser Thr Ser Glu Pro Asp Val Arg Arg He Lys Pro Asn Gin Lys Trp 

15 225 230 235 240 

Phe Val Asn His He Asn Gin His Met Phe Val Pro Phe Leu Tyr Gly 

245 250 255 

20 Leu Leu Ala Phe Lys Val Arg He Gin Asp He Asn He Leu Tyr Phe 

260 265 270 



Val Lys Thr Asn Asp Ala He Arg Val Asn Pro He Ser Thr Trp His 
275 280 285 

Thr Val Met Phe Trp Gly Gly Lys Ala Phe Phe Val Trp Tyr Arg Leu 
290 295 300 



He Val Pro Leu Gin Tyr Leu Pro Leu Gly Lys Val Leu Leu Leu Phe 

30 305 310 315 320 

Thr Val Ala Asp Met Val Ser Ser Tyr Trp Leu Ala Leu Thr Phe Gin 

325 330 335 

35 Ala Asn His Val Val Glu Glu Val Gin Trp Pro Leu Pro Asp Glu Asn 

340 345 350 



Gly He He Gin Lys Asp Trp Ala Ala Met Gin Val Glu Thr Thr Gin 
355 360 365 

Asp Tyr Ala His Asp Ser His Leu Trp Thr Ser He Thr Gly Ser Leu 
370 375 380 



A sn Tyr Gin Ala Val His His Leu Phe Pro Asn Val Ser Gin His His 

45 385 390 395 400 

Tyr Pro Asp He Leu Ala He He Lys Asn Thr Cys Ser Glu Tyr Lys 

405 410 415 

50 Val Tyr Leu Val Lys Asp Thr Phe Trp Gin Ala Phe Ala Ser His 

420 425 430 



Leu Glu His Leu Arg Val Leu Gly Leu Arg Pro Lys Glu Glu 
435 440 445 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 355 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

Glu Val Arg Lys Leu Arg Thr Leu Phe Gin Ser Leu Gly Tyr Tyr Asp 

10 1 5 10 15 

Ser Ser Lys Ala Tyr Tyr Ala Phe Lys Val Ser Phe Asn Leu Cys lie 

20 25 30 

15 Trp Gly Leu Ser Thr Val He Val Ala Lys Trp Gly Gin Thr Ser Thr 

35 40 45 



20 



35 



50 



Leu Ala Asn Val Leu Ser Ala Ala Leu Leu Gly Leu Phe Trp Gin Gin 

50 55 60 

Cys Gly Trp Leu Ala His Asp Phe Leu His His Gin Val Phe Gin Asp 
65 70 75 80 



Arg Phe Trp Gly Asp Leu Phe Gly Ala Phe Leu Gly Gly Val Cys Gin 

25 85 90 95 

Gly Phe Ser Ser Ser Trp Trp Lys Asp Lys His Asn Thr His His Ala 

100 105 " 110 

30 Ala Pro Asn Val His Gly Glu Asp Pro Asp He Asp Thr His Pro Leu 

115 120 125 



Leu Thr Trp Ser Glu His Ala Leu Glu Met Phe Ser Asp Val Pro Asp 
130 135 140 

Glu Glu Leu Thr Arg Met Trp Ser Arg Phe Met Val Leu Asn Gin Thr 

145 150 155 160 



Trp Phe Tyr Phe Pro He Leu Ser Phe Ala Arg Leu Ser Trp Cys Leu 
40 165 170 175 

Gin Ser He Leu Phe Val Leu Pro Asn Gly Gin Ala His Lys Pro Ser 
180 185 ~ 190 

45 Gly Ala Arg Val Pro He Ser Leu Val Glu Gin Leu Ser Leu Ala Met 

195 200 205 



His Trp Thr Trp Tyr Leu Ala Thr Met Phe Leu Phe He Lys Asp Pro 

210 215 220 

Val Asn Met Leu Val Tyr Phe Leu Val Ser Gin Ala Val Cys Gly Asn 

225 230 235 240 



Leu Leu Ala He Val Phe Ser Leu Asn His Asn Gly Met Pro Val He 
55 245 250 255 

Ser Lys Glu Glu Ala Val Asp Met Asp Phe Phe Thr Lys Gin He He 
260 265 270 

60 Thr Gly Arg Asp Val His Pro Gly Leu Phe Ala Asn Trp Phe Thr Gly 

275 280 285 
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Gly Leu Asn Tyr Gin He Glu His His Leu Phe Pro Ser Met Pro Arg 
290 295 300 

His Asn Phe Ser Lys He Gin Pro Ala Val Glu Thr Leu Cys Lys Lys 
5 305 310 315 320 

Tyr Asn Val Arg Tyr His Thr Thr Gly Met He Glu Gly Thr Ala Glu 
325 330 335 

10 Val Pne s ^r Arg Leu Asn Glu Val Ser Lys Ala Ala Ser Lys Met Gly 

340 345 350 



15 



25 



30 



45 



Lys Ala Gin 
355 

(2) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 
20 (B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



Val Thr Leu Tyr Thr Leu Ala Phe Val Ala Ala Asn Ser Leu Gly Val 
15 10 15 

Leu Gly Val Leu Ala Cys Pro Ser Val Xaa Pro His Gin He Ala 

35 20 25 30 

Ala Gly Leu Leu Gly Leu Leu Trp He Gin Ser Ala Tyr He Gly Xaa 
35 40 45 

40 As P Se r Gly His Tyr Val He Met Ser Asn Lys Ser Asn Asn Xaa Phe 

50 55 60 

Ala Gin Leu Leu Ser Gly Asn Cys Leu Thr Gly He He Ala Trp Trp 
65 70 75 80 

Lys Trp Thr His Asn Ala His His Leu Ala Cys Asn Ser Leu Asp Tyr 
85 90 95 

Gly Pro Asn Leu Gin His He Pro 
50 100 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 252 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

60 (ii) MOLECULE TYPE: peptide 
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10 

15 

20 

25 

30 

35 

40 

45 

50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Gly Val Leu Tyr Gly Val Leu Ala Cys Thr Ser Val Phe Ala His Gin 
15 10 15 

lie Ala Ala Ala Leu Leu Gly Leu Leu Trp lie Gin Ser Ala Tyr lie 
20 25 30 

Gly His Asp Ser Gly His Tyr Val lie Met Ser Asn Lys Ser Tyr Asn 
35 40 45 

Arg Phe Ala Gin Leu Leu Ser Gly Asn Cys Leu Thr Gly lie Ser lie 
50 55 60 

Ala Trp Trp Lys Trp Thr His Asn Ala His His Leu Ala Cys Asn Ser 
65 70 75 80 

Leu Asp Tyr Asp Pro Asp Leu Gin His lie Pro Val Phe Ala Val Ser 
85 90 95 

Thr Lys Phe Phe Ser Ser Leu Thr Ser Arg Phe Tyr Asp Arg Lys Leu 
100 105 110 

Thr Phe Gly Pro Val Ala Arg Phe Leu Val Ser Tyr Gin His Phe Thr 
115 120 125 

Tyr Tyr Pro Val Asn Cys Phe Gly Arg lie Asn Leu Phe lie Gin Thr 
130 135 140 

Phe Leu Leu Leu Phe Ser Lys Arg Glu Val Pro Asp Arg Ala Leu Asn 
145 150 155 160 

Phe Ala Gly lie Leu Val Phe Trp Thr Trp Phe Pro Leu Leu Val Ser 
165 170 175 

Cys Leu Pro Asn Trp Pro Glu Arg Phe Phe Phe Val Phe Thr Ser Phe 
180 185 190 

Thr Val Thr Ala Leu Gin His lie Gin Phe Thr Leu Asn His Phe Ala 
195 200 205 

Ala Asp Val Tyr Val Gly Pro Pro Thr Gly Ser Asp Trp Phe Glu Lys 
210 215 220 

Gin Ala Ala Gly Thr lie Asp lie Ser Cys Arg Ser Tyr Met Asp Trp 
225 230 235 240 

Phe Phe Gly Gly Leu Gin Phe Gin Leu Glu His His 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



245 



250 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gly Xaa Xaa Asn Phe Ala Gly He Leu Val Phe Trp Thr Trp Phe Pro 
1 5 10 15 

1A Leu Leu Val Ser c ys Leu Pro Asn Trp Pro Glu Arg Phe Xaa Phe Val 

10 20 25 30 



15 



20 



45 



50 



60 



Phe Thr Gly Phe Thr Val Thr Ala Leu Gin His He Gin Phe Thr Leu 
35 40 45 

Asn His Phe Ala Ala Asp Val Tyr Val Gly Pro Pro Thr Gly Ser Asp 
50 55 60 

Trp Phe Glu Lys Gin Ala Ala Gly Thr He Asp He Ser Cys Arg Ser 
65 70 75 "* 80 

Tyr Met Asp Trp Phe Phe Cys Gly Leu Gin Phe Gin Leu Glu His His 
85 90 95 



Leu phe p ro Arg Leu Pro Arg Cys His Leu Arg Lys Val Ser Pro Val 
25 100 105 * HO 

Gly Gin Arg Gly Phe Gin Arg Lys Xaa Asn Leu Ser Xaa 
115 120 125 

30 (2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 amino acids 

(B) TYPE: amino acid 

35 (C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



Pro Ala Thr Glu Val Gly Gly Leu Ala Trp Met He Thr Phe Tyr Val 
1 5 10 15 

Arg Phe Phe Leu Thr Tyr Val Pro Leu Leu Gly Leu Lys Ala Phe Leu 
20 25 30 

Gly Leu Phe Phe He Val Arg Phe Leu Glu Ser Asn Trp Phe Val Trp 
35 40 45 

Val Thr Gln Me t Asn His He Pro Met His lie Asp His Asp Arg Asn 
55 50 55 60 



Met Asp Trp Val Ser Thr Gin Leu Gin Ala Thr Cys Asn Val His Lys 

65 7 0 75 80 

Ser Ala Phe Asn Asp Trp Phe Ser Gly His Leu Asn Phe Gin He Glu 
85 90 95 
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His His Leu Phe Pro Thr Met Pro Arg His Asn Tyr His Xaa Val Ala 
100 105 110 

Pro Leu Val Gin Ser Leu Cys Ala Lys His Gly lie Glu Tyr Gin Ser 
115 120 125 

Lys Pro Leu 
130 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Cys Ser Pro Lys Ser Ser Pro Thr Arg Asn Met Thr Pro Ser Pro Phe 
15 10 15 

lie Asp Trp Leu Trp Gly Gly Leu Asn Tyr Gin lie Glu His His Leu 
20 25 30 

Phe Pro Thr Met Pro Arg Cys Asn Leu Asn Arg Cys Met Lys Tyr Val 
35 40 45 

Lys Glu Trp Cys Ala Glu Asn Asn Leu Pro Tyr Leu Val Asp Asp Tyr 
50 55 60 

Phe Val Gly Tyr Asn Leu Asn Leu Gin Gin Leu Lys Asn Met Ala Glu 
65 70 75 80 

Leu Val Gin Ala Lys Ala Ala 
85 

(2) INFORMATION FOR SEQ ID NO:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Arg His Glu Ala Ala Arg Gly Gly Thr Arg Leu Ala Tyr Met Leu Val 
15 10 15 

Cys Met Gin Trp Thr Asp Leu Leu Trp Ala Ala Ser Phe Tyr Ser Arg 
20 25 30 
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10 



20 



30 



35 



40 



50 



55 



Phe Phe Leu Ser Tyr Ser Pro Phe Tyr Gly Ala Thr Gly Thr Leu Leu 
35 40 45 

Leu Phe Val Ala Val Arg Val Leu Glu Ser His Trp Phe Val Trp lie 
50 55 60 

Thr Gin Met Asn His lie Pro Lys Glu He Gly His Glu Lys His Arg 
65 70 75 80 

Asp Trp Ala Ser Ser Gin Leu Ala Ala Thr Cys Asn Val Glu Pro Ser 
85 90 " 95 



Leu ph e He Asp Trp Phe Ser Gly His Leu Asn Phe Gin He Glu His 
15 100 105 110 



His Leu Phe Pro Thr Met Thr Arg His Asn Tyr Arg Xaa Val Ala Pro 
115 120 125 

Leu Val Lys Ala Phe Cys Ala Lys His Gly Leu His Tyr Glu Val 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 14: 



25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Leu His His Thr Tyr Thr Asn lie Ala Gly Ala Asp Pro Asp Val Ser 
1 5 10 15 

Thr Ser Glu Pro Asp Val Arg Arg He Lys Pro Asn Gin Lys Trp Phe 
20 25 30 

Val A sn His He Asn Gin His Met Phe Val Pro Phe Leu Tyr Gly Leu 
45 35 40 45 

Leu Ala Phe Lys Val Arg He Gin Asp He Asn He Leu Tyr Phe Val 
50 55 60 

Lys Thr Asn Asp Ala He Arg Val Asn Pro He Ser Thr Trp His Thr 
65 70 75 80 

Val Met Phe Trp Gly Gly Lys Ala Phe Phe Val Trp Tyr Arg Leu He 
8 5 90 95 

Val Pro Leu Gin Tyr Leu Pro Leu Gly Lys Val Leu Leu Leu Phe Thr 
100 105 110 

Val Ala Asp Met Val Ser Ser Tyr Trp Leu Ala Leu Thr Phe Gin Ala 
60 115 120 125 
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Asn Tyr Val Val Glu Glu Val Gin Trp Pro Leu Pro Asp Glu Asn Gly 
130 135 140 

He He Gin Lys Asp Trp Ala Ala Met Gin Val Glu Thr Thr Gin Asp 
145 150 155 160 

Tyr Ala His Asp Ser His Leu Trp Thr Ser He Thr Gly Ser Leu Asn 
165 170 175 

Tyr Gin Xaa Val His His Leu Phe Pro His 
180 185 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

His Xaa Xaa His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Ala Ala Gin He Lys Lys Tyr He Thr Ser Asp Glu Leu Lys Asn 
15 10 15 

His Asp Lys Pro Gly Asp Leu Trp He Ser He Gin Gly Lys Ala Tyr 
20 25 30 

Asp Val Ser Asp Trp Val Lys Asp His Pro Gly Gly Ser Phe Pro Leu 
35 40 45 

Lys Ser Leu Ala Gly Gin Glu Val Thr Asp Ala Phe Val Ala Phe His 

50 55 60 

Pro Ala Ser Thr Trp Lys Asn Leu Asp Lys Phe Phe Thr Gly Tyr Tyr 
65 70 75 80 

Leu Lys Asp Tyr Ser Val Ser Glu Val Ser Lys Val Tyr Arg Lys Leu 
85 90 95 
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10 



25 



35 



40 



50 



55 



60 



Val Phe Glu Phe Ser Lys Met Gly Leu Tyr Asp Lys Lys Gly His lie 
100 105 " 110 

Met Phe Ala Thr Leu Cys Phe lie Ala Met Leu Phe Ala Met Ser Val 
115 120 125 

Tyr Gly Val Leu Phe Cys Glu Gly Val Leu Val His Leu Phe Ser Gly 
130 135 140 

Cys Leu Met Gly Phe Leu Trp lie Gin Ser Gly Trp lie Gly His Asp 
145 150 155 160 



Ala Gly His Tyr Met Val Val Ser Asp Ser Arg Leu Asn Lys Phe Met 
15 165 170 175 

Gly lie Phe Ala Ala Asn Cys Leu Ser Gly lie Ser lie Gly Trp Trp 
180 185 190 

20 Lys Trp Asn His Asn Ala His His lie Ala Cys Asn Ser Leu Glu Tyr 

195 200 205 



Asp Pro Asp Leu Gin Tyr He Pro Phe Leu Val Val Ser Ser Lys Phe 

210 215 220 

Phe Gly Ser Leu Thr Ser His Phe Tyr Glu Lys Arg Leu Thr Phe Asp 

225 230 235 240 



Ser Leu Ser Arg Phe Phe Val Ser Tyr Gin His Trp Thr Phe Tyr Pro 
30 245 250 255 



He Met Cys Ala Ala Arg Leu Asn Met Tyr Val Gin Ser Leu He Met 
260 265 270 

Leu Leu Thr Lys Arg Asn Val Ser Tyr Arg Ala Gin Glu Leu Leu Gly 
275 280 285 

Cys Leu Val Phe Ser He Trp Tyr Pro Leu Leu Val Ser Cys Leu Pro 
290 295 300 

Asn Trp Gly Glu Arg He Met Phe Val He Ala Ser Leu Ser Val Thr 
305 310 315 320 



Gly Met Gin Gin Val Gin Phe Ser Leu Asn His Phe Ser Ser Ser Val 
45 325 330 - 335 



Tyr Val -Gly Lys Pro Lys Gly Asn Asn Trp Phe Glu Lys Gin Thr Asp 
340 345 350 

Gly Thr Leu Asp He Ser Cys Pro Pro Trp Met Asp Trp Phe His Gly 
355 360 365 

Gly Leu Gin Phe Gin He Glu His His Leu Phe Pro Lys Met Pro Arg 

370 375 380 

Cys Asn Leu Arg Lys He Ser Pro Tyr Val He Glu Leu Cys Lys Lys 
385 390 395 ' 400 

His Asn Leu Pro Tyr Asn Tyr Ala Ser Phe Ser Lys Ala Asn Glu Met 
405 410 415 
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Thr Leu Arg Thr Leu Arg Asn Thr Ala Leu Gin Ala Arg Asp lie Thr 
420 425 430 

Lys Pro Leu Pro Lys Asn Leu Val Trp Glu Ala Leu His Thr 
5 435 440 445 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 359 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: peptide 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Leu Thr Ala Glu Arg He Lys Phe Thr Gin Lys Arg Gly Phe Arg 
15 10 15 

25 Arg Val Leu Asn Gin Arg Val Asp Ala Tyr Phe Ala Glu His Gly Leu 

20 25 30 



30 



40 



45 



55 



60 



Thr Gin Arg Asp Asn Pro Ser Met Tyr Leu Lys Thr Leu He He Val 
35 4 0 45 

Leu Trp Leu Phe Ser Ala Trp Ala Phe Val Leu Phe Ala Pro Val He 
50 55 60 



Phe Pro Val Arg Leu Leu Gly Cys Met Val Leu Ala He Ala Leu Ala 
35 65 70 75 80 



Ala Phe Ser Phe Asn Val Gly His Asp Ala Asn His Asn Ala Tyr Ser 
85 90 95 

Ser Asn Pro His He Asn Arg Val Leu Gly Met Thr Tyr Asp Phe Val 
100 105 110 

Gly Leu Ser Ser Phe Leu Trp Arg Tyr Arg His Asn Tyr Leu His His 
115 120 125 

Thr Tyr Thr Asn He Leu Gly His Asp Val Glu He His Gly Asp Gly 
130 135 140 



Ala Val Arg Met Ser Pro Glu Gin Glu His Val Gly He Tyr Arg Phe 
50 145 150 155 160 



Gin Gin Phe Tyr He Trp Gly Leu Tyr Leu Phe lie Pro Phe Tyr Trp 

165 170 175 

Phe Leu Tyr Asp Val Tyr Leu Val Leu Asn Lys Gly Lys Tyr His Asp 

180 185 ^ " 190 

His Lys He Pro Pro Phe Gin Pro Leu Glu Leu Ala Ser Leu Leu Gly 

195 200 205 

He Lys Leu Leu Trp Leu Gly Tyr Val Phe Gly Leu Pro Leu Ala Leu 

210 215 220 
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Gly Phe Ser He Pro Glu Val Leu lie Gly Ala Ser Val Thr Tyr Met 
225 230 235 240 

Thr Tyr Gly He Val Val Cys Thr He Phe Met Leu Ala His Val Leu 
245 250 255 

Glu Ser Thr Glu Phe Leu Thr Pro Asp Gly Glu Ser Gly Ala He Asp 
260 265 ~ 270 

Asp Glu Trp Ala He Cys Gin He Arg Thr Thr Ala Asn Phe Ala Thr 
275 280 285 



Asn Asn Pro Phe Trp Asn Trp Phe Cys Gly Gly Leu Asn His Gin Val 
15 290 295 300 



Thr His His Leu Phe Pro Asn lie Cys His He His Tyr Pro Gin Leu 
305 310 315 " 320 

Glu Asn He He Lys Asp Val Cys Gin Glu Phe Gly Val Glu Tyr Lys 
325 330 335 

Val Tyr Pro Thr Phe Lys Ala Ala He Ala Ser Asn Tyr Arg Trp Leu 
340 345 350 

Glu Ala Met Gly Lys Ala Ser 
355 

(2) INFORMATION FOR SEQ ID NO: 18: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 365 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
35 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



Ar Met Thr Ser Thr Thr Ser Lys Val Thr Phe Gly Lys Ser He Gly Phe 

45 1 5 10 15 



Arg Lys Glu Leu Asn Arg Arg Val Asn Ala Tyr Leu Glu Ala Glu Asn 
20 25 " 30 

He Ser Pro Arg Asp Asn Pro Pro Met Tyr Leu Lys Thr Ala He He 
35 40 45 

Leu Ala Trp Val Val Ser Ala Trp Thr Phe Val Val Phe Gly Pro Asp 
50 55 60 

Val Leu Trp Met Lys Leu Leu Gly Cys He Val Leu Gly Phe Gly Val 
65 70 75 80 

Ser Ala Val Gly Phe Asn He Ser His Asp Gly Asn His Gly Gly Tyr 
85 90 ' 95 
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Ser Lys Tyr Gin Trp Val Asn Tyr Leu Ser Gly Leu Thr His Asp Ala 
100 105 110 

lie Gly Val Ser Ser Tyr Leu Trp Lys Phe Arg His Asn Val Leu His 
115 120 125 

His Thr Tyr Thr Asn lie Leu Gly His Asp Val Glu lie His Gly Asp 
130 135 140 

Glu Leu Val Arg Met Ser Pro Ser Met Glu Tyr Arg Trp Tyr His Arg 
145 150 155 160 

Tyr Gin His Trp Phe He Trp Phe Val Tyr Pro Phe He Pro Tyr Tyr 
165 170 175 

Trp Ser He Ala Asp Val Gin Thr Met Leu Phe Lys Arg Gin Tyr His 
180 185 190 

Asp His Glu He Pro Ser Pro Thr Trp Val Asp He Ala Thr Leu Leu 
195 200 205 

Ala Phe Lys Ala Phe Gly Val Ala Val Phe Leu He He Pro He Ala 
210 215 220 

Val Gly Tyr Ser Pro Leu Glu Ala Val He Gly Ala Ser He Val Tyr 
225 230 235 240 

Met Thr His Gly Leu Val Ala Cys Val Val Phe Met Leu Ala His Val 
245 250 255 

He Glu Pro Ala Glu Phe Leu Asp Pro Asp Asn Leu His lie Asp Asp 
260 265 270 

Glu Trp Ala He Ala Gin Val Lys Thr Thr Val Asp Phe Ala Pro Asn 
275 280 285 

Asn Thr He He Asn Trp Tyr Val Gly Gly Leu Asn Tyr Gin Thr Val 
290 295 300 

His His Leu Phe Pro His lie Cys His lie His Tyr Pro Lys lie Ala 
305 310 315 - 320 

Pro lie Leu Ala Glu Val Cys Glu Glu Phe Gly Val Asn Tyr Ala Val 
325 330 335 

His Gin Thr Phe Phe Gly Ala Leu Ala Ala Asn Tyr Ser Trp Leu Lys 
340 345 350 

Lys Met Ser He Asn Pro Glu Thr Lys Ala lie Glu Gin 
355 360 365 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

5 CCAAGCTTCT GCAGGAGCTC TTTTTTTTTT ttxtt 35 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic oligonucleotide" 

(ix) FEATURE: 
20 (A) NAME/KEY: misc_f eature 

(B) LOCATION: 21 

(D) OTHER INFORMATION: /number= 1 
/note= "N=Inosine or Cytosine" 

25 (ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION: 27 

(D) OTHER INFORMATION: /number^ 2 
/note= "N=Inosine or Cytosine" 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
CUACUACUAC UACAYCAYAC NTAYACNAAY AT 32 
(2) INFORMATION FOR SEQ ID NO: 21: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
45 .(A) DESCRIPTION: /desc « "Synthetic oligonucleotide* 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 
50 (B) LOCATION: 13 

(D) OTHER INFORMATION: /number= 1 
/note= "N=Inosine or Cytosine" 

(ix) FEATURE: 
55 (A) NAME/KEY: misc_f eature 

(B) LOCATION: 19 

(D) OTHER INFORMATION: /number= 2 
/note= "N=Inosine or Cytosine" 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 
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CAUCAUCAUC AUNGGRAANA RRTGRTG 27 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CUACUACUAC UAGGAGTCCT CTACGGTGTT TTG 33 
(2) INFORMATION FOR SEQ ID NO: 23: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
CAUCAUCAUC AUATGATGCT CAAGCTGAAA CTG 33 
_(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Gin Xaa Xaa His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
CUACUACUAC UACTCGAGCA AGATGGGAAC GGACCAAGG 39 
10 (2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



20 



55 



60 



(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

25 CAUCAUCAUC AUCTCGAGCT ACTCTTCCTT GGGACGGAG 39 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: other nucleic acid 



40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

CUACUACUAC UATCTAGACT CGAGACCATG GCTGCTGCTC CAGTGTG 47 
(2) INFORMATION FOR SEQ ID NO: 28: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
50 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
CAUCAUCAUC AUAGGCCTCG AGTTACTGCG CCTTACCCAT 40 
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(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

CUACUACUA CUAGGATCCA TGGCACCTCC CAACACT 37 

(2) INFORMATION FOR SEQ ID NO: 30: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
~{D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

30 CAUCAUCAU CAUGGTACCT CGAGTTACTT CTTGAAAAAG AC 42 

(2) INFORMATION FOR SEQ ID NO: 31: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1219 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 2 692004) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



GCACGCCGAC 


CGGCGCCGGG 


AGATCCTGGC 


AAAGTATCCA 


GAGATAAAGT 


CCTTGATGAA 


60 


ACCTGATCCC 


AATTTGATAT 


GGATTATAAT 


TATGATGGTT 


CTCACCCAGT 


TGGGTGCATT 


120 


TTACATAGTA 


AAAGACTTGG 


ACTGGAAATG 


GGTCATATTT 


GGGGCCTATG 


CGTTTGGCAG 


180 


TTGCATTAAC 


CACTCAATGA 


CTCTGGCTAT 


TCATGAGATT 


GCCCACAATG 


CTGCCTTTGG 


240 


CAACTGCAAA 


GCAATGTGGA 


ATCGCTGGTT 


TGGAATGTTT 


GCTAATCTTC 


CTATTGGGAT 


300 


TCCATATTCA 


ATTTCCTTTA 


AGAGGTATCA 


CATGGATCAT 


CATCGGTACC 


TTGGAGCTGA 


360 


TGGCGTCGAT 


GTAGATATTC 


CTACCGATTT 


TGAGGGCTGG 


TTCTTCTGTA 


CCGCTTTCAG 


420 


AAAGTTTATA 


TGGGTTATTC 


TTCAGCCTCT 


CTTTTATGCC 


TTTCGACCTC 


TGTTCATCAA 


480 
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CCCCAAACCA 


ATTACGTATC 


TGGAAGTTAT 


CAATACCGTG 


GCACAGGTCA 


CTTTTGACAT 


540 


5 


TTTAATTTAT 


TACTTTTTGG 


GAATTAAATC 


CTTAGTCTAC 


ATGTTGGCAG 


CATCTTTACT 


600 




TGGCCTGGGT 


TTGCACCCAA 


TTTCTGGACA 


TTTTATAGCT 


GAGCATTACA 


TGTTCTTAAA 


660 




GGGTCATGAA 


ACTTACTCAT 


ATTATGGGCC 


TCTGAATTTA 


CTTACCTTCA 


ATGTGGGTTA 


720 


10 


TCATAATGAA 


CATCATGATT 


TCCCCAACAT 


TCCTGGAAAA 


AGTCTTCCAC 


TGGTGAGGAA 


780 




AATAGCAGCT 


GAATACTATG 


ACAACCTCCC 


TCACTACAAT 


TCCTGGATAA 


AAGTACTGTA 


840 


15 


TGATTTTGTG 


ATGGATGATA 


CAATAAGTCC 


CTACTCAAGA 


ATGAAGAGGC 


ACCAAAAAGG 


900 


AGAGATGGTG 


CTGGAGTAAA 


TATCATTAGT 


GCCAAAGGGA 


TTCTTCTCCA 


AAACTTTAGA 


960 




TGATAAAATG 


GAATTTTTGC 


ATTATTAAAC 


TTGAGACCAG 


TGATGCTCAG 


AAGCTCCCCT 


1020 


20 


GGCACAATTT 


CAGAGTAAGA 


GCTCGGTGAT 


ACCAAGAAGT 


GAATCTGGCT 


TTTAAACAGT 


1080 




CAGCCTGACT 


CTGTACTGCT 


CAGTTTCACT 


CACAGGAAAC 


TTGTGACTTG 


TGTATTATCG 


1140 


25 


TCATTGAGGA 


TGTTTCACTC 


ATGTCTGTCA 


TTTTATAAGC 


ATATCATTTA 


AAAAGCTTCT 


1200 


AAAAAGCTAT 


TTCGCCAGG 










1219 


30 


(2) INFORMATION FOR SEQ ID NO: 32 










35 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 655 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 










(ii) MOLECULE TYPE: other nucleic acid 


(Edited Contig 2153526) 




40 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 










TTACCTTCTA 


CGTCCGCTTC 


TTCCTCACTT 


ATGTGCCACT 


ATTGGGGCTG 


AAAGCTTCCT 


60 


GGGCCTTTTC 


TTCATAGTCA 


GGTTCCTGGA 


AAGCAACTGG 


TTTGTGTGGG 


TGACACAGAT 


120 




GAACCATATT 


CCCATGCACA 


TTGATCATGA 


CCGGAACATG 


GACTGGGTTT 


CCACCCAGCT 


180 


50 


CCAGGCCACA 


TGCAATGTCC 


ACAAGTCTGC 


CTTCAATGAC 


TGGTTCAGTG 


GACACCTCAA 


240 




CTTCCAGATT 


GAGCACCATC 


TTTTTCCCAC 


GATGCCTCGA 


CACAATTACC 


ACAAAGTGGC 


JUU 


55 


TCCCCTGGTG 


CAGTCCTTGT 


GTGCCAAGCA 


TGGCATAGAG 


TACCAGTCCA 


AGCCCCTGCT 


360 




GTCAGCCTTC 


GCCGACATCA 


TCCACTCACT 


AAAGGAGTCA 


GGGCAGCTCT 


GGCTAGATGC 


420 




CTATCTTCAC 


CAATAACAAC 


AGCCACCCTG 


CCCAGTCTGG 


AAGAAGAGGA 


GGAAGACTCT 


480 


60 


GGAGCCAAGG 


CAGAGGGGAG 


CTTGAGGGAC 


AATGCCACTA 


TAGTTTAATA 


CTCAGAGGGG 


540 




GTTGGGTTTG 


GGGACATAAA 


GCCTCTGACT 


CAAACTCCTC 


CCTTTTATCT 


TCTAGCCACA 


600 
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GTTCTAAGAC CCAAAGTGGG GGGTGGACAC AGAAGTCCCT AGGAGGGAAG GAGCT 655 

5 (2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 304 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 3506132) 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

GTCTTTTACT TTGGCAATGG CTGGATTCCT ACCCTCATCA CGGCCTTTGT CCTTGCTACC 60 

TCTCAGGCCC AAGCTGGATG GCTGCAACAT GATTATGGCC ACCTGTCTGT CTACAGAAAA 120 

20 

CCCAAGTGGA ACCACCTTGT CCACAAATTC GTCATTGGCC ACTTAAAGGG TGCCTCTGCC 180 

AACTGGTGGA ATCATCGCCA CTTCCAGCAC CACGCCAAGC CTAACATCTT CCACAAGGAT 240 

25 CCCGATGTGA ACATGCTGCA CGTGTTTGTT CTGGGCGAAT GGCAGCCCAT CGAGTACGGC 300 

AAGA 304 

30 (2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 918 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: other nucleic acid (Edited Contig 3854933) 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

CAGGGACCTA CCCCGCGCTA CTTCACCTGG GACGAGGTGG CCCAGCGCTC AGGGTGCGAG 60 

GAGCGGTGGC TAGTGATCGA CCGTAAGGTG TACAACATCA GCGAGTTCAC CCGCCGGCAT 120 

45 

CCAGGGGGCT CCCGGGTCAT CAGCCACTAC GCCGGGCAGG ATGCCACGGA TCCCTTTGTG 180 

GCCTTCCACA TCAACAAGGG CCTTGTGAAG AAGTATATGA ACTCTCTCCT GATTGGAGAA 24 0 

50 CTGTCTCCAG AGCAGCCCAG CTTTGAGCCC ACCAAGAATA AAGAGCTGAC AGATGAGTTC 300 

CGGGAGCTGC GGGCCACAGT GGAGCGGATG GGGCTCATGA AGGCCAACCA TGTCTTCTTC 3 60 

^ CTGCTGTACC TGCTGCACAT CTTGCTGCTG GATGGTGCAG CCTGGCTCAC CCTTTGGGTC 420 

TTTGGGACGT CCTTTTTGCC CTTCCTCCTC TGTGCGGTGC TGCTCAGTGC AGTTCAGGCC 4 80 

CAGGCTGGCT GGCTGCAGCA TGACTTTGGG CACCTGTCGG TCTTCAGCAC CTCAAAGTGG 54 0 

60 AACCATCTGC TACATCATTT TGTGATTGGC CACCTGAAGG GGGCCCCCGC CAGTTGGTGG 600 

AACCACATGC ACTTCCAGCA CCATGCCAAG CCCAACTGCT TCCGCAAAGA CCCAGACATC 660 
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AACATGCATC CCTTCTTCTT TGCCTTGGGG AAGATCCTCT CTGTGGAGCT TGGGAAACAG 720 

AAGAAAAAAT ATATGCCGTA CAACCACCAG CACARATACT TCTTCCTAAT TGGGCCCCCA 7 80 

GCCTTGCTGC CTCTCTACTT CCAGTGGTAT ATTTTCTATT TTGTTATCCA GCGAAAGAAG 84 0 

TGGGTGGACT TGGCCTGGAT CAGCAAACAG GAATACGATG AAGCCGGGCT TCCATTGTCC 900 

10 ACCGCAAATG CTTCTAAA 918 

(2) INFORMATION FOR SEQ ID NO: 35: 

15 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 168 6 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 2511785) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

25 





GCCACTTAAA 


GGGTGCCTCT 


GCCAACTGGT 


GGAATCATCG 


CCACTTCCAG 


CACCACGCCA 


60 




AGCCTAACAT 


CTTCCACAAG 


GATCCCGATG 


TGAACATGCT 


GCACGTGTTT 






30 


AATGGCAGCC 


CATCGAGTAC 


GGCAAGAAGA 


AGCTGAAATA 


CCTGCCCTAC 


AATCACCAGC 


180 




ACGAATACTT 


CTTCCTGATT 


GGGCCGCCGC 


TGCTCATCCC 


CATGTATTTC 


CAGTACCAGA 


240 


35 


TCATCATGAC 


CATGATCGTC 


CATAAGAACT 


GGGTGGACCT 


GGCCTGGGCC 


GTCAGCTACT 


300 


ACATCCGGTT 


CTTCATCACC 


TACATCCCTT 


TCTACGGCAT 


CCTGGGAGCC 


CTCCTTTTCC 


360 




TCAACTTCAT 


CAGGTTCCTG 


GAGAGCCACT 


GGTTTGTGTG 


GGTCACACAG 


ATGAATCACA 


420 


40 


TCGTCATGGA 


GATTGACCAG 


GAGGCCTACC 


GTGACTGGTT 


CAGTAGCCAG 


CTGACAGCCA 


480 




CCTGCAACGT 


GGAGCAGTCC 


TTCTTCAACG 


ACTGGTTCAG 


TGGACACCTT 


AACTTCCAGA 


540 


45 


TTGAGCACCA 


CCTCTTCCCC 


ACCATGCCCC 


GGCACAACTT 


ACACAAGATC 


GCCCCGCTGG 


600 




TGAAGTCTCT 


ATGTGCCAAG 


CATGGCATTG 


AATACCAGGA 


GAAGCCGCTA 


CTGAGGGCCC 


660 




TGCTGGACAT 


CATCAGGTCC 


CTGAAGAAGT 


CTGGGAAGCT 


GTGGCTGGAC 


GCCTACCTTC 


720 


50 


ACAAATGAAG 


CCACAGCCCC 


CGGGACACCG 


TGGGGAAGGG 


GTGCAGGTGG 


GGTGATGGCC 


780 




AGAGGAATGA 


TGGGCTTTTG 


TTCTGAGGGG 


TGTCCGAGAG 


GCTGGTGTAT 


GCACTGCTCA 


840 


55 


CGGACCCCAT 


GTTGGATCTT 


TCTCCCTTTC 


TCCTCTCCTT 


TTTCTCTTCA 


CATCTCCCCC 


900 




ATAGCACCCT 


GCCCTCATGG 


GACCTGCCCT 


CCCTCAGCCG 


TCAGCCATCA 


GCCATGGCCC 


960 




TCCCAGTGCC 


TCCTAGCCCC 


TTCTTCCAAG 


GAGCAGAGAG 


GTGGCCACCG 


GGGGTGGCTC 


1020 


60 


TGTCCTACCT 


CCACTCTCTG 


CCCCTAAAGA 


TGGGAGGAGA 


CCAGCGGTCC 


ATGGGTCTGG 


1080 




CCTGTGAGTC 


TCCCCTTGCA 


GCCTGGTCAC 


TAGGCATCAC 


CCCCGCTTTG 


GTTCTTCAGA 


1140 
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TGCTCTTGGG GTTCATAGGG GCAGGTCCTA GTCGGGCAGG GCCCCTGACC CTCCCGGCCT 1200 

GGCTTCACTC TCCCTGACGG CTGCCATTGG TCCACCCTTT CATAGAGAGG CCTGCTTTGT 1260 

TACAAAGCTC GGGTCTCCCT CCTGCAGCTC GGTTAAGTAC CCGAGGCCTC TCTTAAGATG 1320 

TCCAGGGCCC CAGGCCCGCG GGCACAGCCA GCCCAAACCT TGGGCCCTGG AAGAGTCCTC 1380 

CACCCCATCA CTAGAGTGCT CTGACCCTGG GCTTTCACGG GCCCCATTCC ACCGCCTCCC 14 40 

CAACTTGAGC CTGTGACCTT GGGACCAAAG GGGGAGTCCC TCGTCTCTTG TGACTCAGCA 1500 

GAGGCAGTGG CCACGTTCAG GGAGGGGCCG GCTGGCCTGG AGGCTCAGCC CACCCTCCAG 1560 

CTTTTCCTCA GGGTGTCCTG AGGTCCAAGA TTCTGGAGCA ATCTGACCCT TCTCCAAAGG 1620 

CTCTGTTATC AGCTGGGCAG TGCCAGCCAA TCCCTGGCCA TTTGGCCCCA GGGGACGTGG 1680 

GCCCTG 168 6 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1843 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: other nucleic acid (Contig 2535) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

GTCTTTTACT TTGGCAATGG CTGGATTCCT ACCCTCATCA CGGCCTTTGT CCTTGCTACC 60 

TCTCAGGCCC AAGCTGGATG GCTGCAACAT GATTATGGCC ACCTGTCTGT CTACAGAAAA 120 

CCCAAGTGGA ACCACCTTGT CCACAAATTC GTCATTGGCC ACTTAAAGGG TGCCTCTGCC 180 

AACTGGTGGA ATCATCGCCA CTTCCAGCAC CACGCCAAGC CTAACATCTT CCACAAGGAT 24 0 

CCCGATGTGA ACATGCTGCA CGTGTTTGTT CTGGGCGAAT GGCAGCCCAT CGAGTACGGC 300 

AAGAAGAAGC TGAAATACCT GCCCTACAAT CACCAGCACG AATACTTCTT CCTGATTGGG 360 

CCGCCGCTGC TCATCCCCAT GTATTTCCAG TACCAGATCA TCATGACCAT GATCGTCCAT 420 

AAGAACTGGG TGGACCTGGC CTGGGCCGTC AGCTACTACA TCCGGTTCTT CATCACCTAC 4 80 

ATCCCTTTCT ACGGCATCCT GGGAGCCCTC CTTTTCCTCA ACTTCATCAG GTTCCTGGAG 54 0 

AGCCACTGGT TTGTGTGGGT CACACAGATG AATCACATCG TCATGGAGAT TGACCAGGAG 600 

GCCTACCGTG ACTGGTTCAG TAGCCAGCTG ACAGCCACCT GCAACGTGGA GCAGTCCTTC 660 

TTCAACGACT GGTTCAGTGG ACACCTTAAC TTCCAGATTG AGCACCACCT CTTCCCCACC 720 

ATGCCCCGGC ACAACTTACA CAAGATCGCC CCGCTGGTGA AGTCTCTATG TGCCAAGCAT 780 

GGCATTGAAT ACCAGGAGAA GCCGCTACTG AGGGCCCTGC TGGACATCAT CAGGTCCCTG 840 
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AAGAAGTCTG GGAAGCTGTG GCTGGACGCC TACCTTCACA AATGAAGCCA CAGCCCCCGG 900 

GACACCGTGG GGAAGGGGTG CAGGTGGGGT GATGGCCAGA GGAATGATGG GCTTTTGTTC 960 

TGAGGGGTGT CCGAGAGGCT GGTGTATGCA CTGCTCACGG ACCCCATGTT GGATCTTTCT 1020 

CCCTTTCTCC TCTCCTTTTT CTCTTCACAT CTCCCCCATA GCACCCTGCC CTCATGGGAC 1080 

CTGCCCTCCC TCAGCCGTCA GCCATCAGCC ATGGCCCTCC CAGTGCCTCC TAGCCCCTTC 114 0 

TTCCAAGGAG CAGAGAGGTG GCCACCGGGG GTGGCTCTGT CCTACCTCCA CTCTCTGCCC 1200 

CTAAAGATGG GAGGAGACCA GCGGTCCATG GGTCTGGCCT GTGAGTCTCC CCTTGCAGCC 12 60 

TGGTCACTAG GCATCACCCC CGCTTTGGTT CTTCAGATGC TCTTGGGGTT CATAGGGGCA 1320 

GGTCCTAGTC GGGCAGGGCC CCTGACCCTC CCGGCCTGGC TTCACTCTCC CTGACGGCTG 1380 

CCATTGGTCC ACCCTTTCAT AGAGAGGCCT GCTTTGTTAC AAAGCTCGGG TCTCCCTCCT 14 40 

GCAGCTCGGT TAAGTACCCG AGGCCTCTCT TAAGATGTCC AGGGCCCCAG GCCCGCGGGC 1500 

ACAGCCAGCC CAAACCTTGG GCCCTGGAAG AGTCCTCCAC CCCATCACTA GAGTGCTCTG 1560 

ACCCTGGGCT TTCACGGGCC CCATTCCACC GCCTCCCCAA CTTGAGCCTG TGACCTTGGG 1620 

ACCAAAGGGG GAGTCCCTCG TCTCTTGTGA CTCAGCAGAG GCAGTGGCCA CGTTCAGGGA 1680 

GGGGCCGGCT GGCCTGGAGG CTCAGCCCAC CCTCCAGCTT TTCCTCAGGG TGTCCTGAGG 1740 

TCCAAGATTC TGGAGCAATC TGACCCTTCT CCAAAGGCTC TGTTATCAGC TGGGCAGTGC 1800 

CAGCCAATCC CTGGCCATTT GGCCCCAGGG GACGTGGGCC CTG 1843 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2257 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 253538a) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

CAGGGACCTA CCCCGCGCTA CTTCACCTGG GACGAGGTGG CCCAGCGCTC AGGGTGCGAG 60 

GAGCGGTGGC TAGTGATCGA CCGTAAGGTG TACAACATCA GCGAGTTCAC CCGCCGGCAT 120 

CCAGGGGGCT CCCGGGTCAT CAGCCACTAC GCCGGGCAGG ATGCCACGGA TCCCTTTGTG 180 

GCCTTCCACA TCAACAAGGG CCTTGTGAAG AAGTATATGA ACTCTCTCCT GATTGGAGAA 2 40 

CTGTCTCCAG AGCAGCCCAG CTTTGAGCCC ACCAAGAATA AAGAGCTGAC AGATGAGTTC 300 

CGGGAGCTGC GGGCCACAGT GGAGCGGATG GGGCTCATGA AGGCCAACCA TGTCTTCTTC 360 

CTGCTGTACC TGCTGCACAT CTTGCTGCTG GATGGTGCAG CCTGGCTCAC CCTTTGGGTC 420 
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TTTGGGACGT CCTTTTTGCC CTTCCTCCTC TGTGCGGTGC TGCTCAGTGC AGTTCAGCAG 4 80 
GCCCAAGCTG GATGGCTGCA ACATGATTAT GGCCACCTGT CTGTCTACAG AAAACCCAAG 54 0 
TGGAACCACC TTGTCCACAA ATTCGTCATT GGCCACTTAA AGGGTGCCTC TGCCAACTGG 600 
TGGAATCATC GCCACTTCCA GCACCACGCC AAGCCTAACA TCTTCCACAA GGATCCCGAT 660 
GTGAACATGC TGCACGTGTT TGTTCTGGGC GAATGGCAGC CCATCGAGTA CGGCAAGAAG 720 
AAGCTGAAAT ACCTGCCCTA CAATCACCAG CACGAATACT TCTTCCTGAT TGGGCCGCCG 78 0 
CTGCTCATCC CCATGTATTT CCAGTACCAG ATCATCATGA CCATGATCGT CCATAAGAAC 84 0 
15 TGGGTGGACC TGGCCTGGGC CGTCAGCTAC TACATCCGGT TCTTCATCAC CTACATCCCT 900 

TTCTACGGCA TCCTGGGAGC CCTCCTTTTC CTCAACTTCA TCAGGTTCCT GGAGAGCCAC 960 

TGGTTTGTGT GGGTCACACA GATGAATCAC ATCGTCATGG AGATTGACCA GGAGGCCTAC 1020 

20 

CGTGACTGGT TCAGTAGCCA GCTGACAGCC ACCTGCAACG TGGAGCAGTC CTTCTTCAAC 1080 

GACTGGTTCA GTGGACACCT TAACTTCCAG ATTGAGCACC ACCTCTTCCC CACCATGCCC 114 0 

25 CGGCACAACT TACACAAGAT CGCCCCGCTG GTGAAGTCTC TATGTGCCAA GCATGGCATT 1200 

GAATACCAGG AGAAGCCGCT ACTGAGGGCC CTGCTGGACA TCATCAGGTC CCTGAAGAAG 12 60 

TCTGGGAAGC TGTGGCTGGA CGCCTACCTT CACAAATGAA GCCACAGCCC CCGGGACACC 1320 

30 

GTGGGGAAGG GGTGCAGGTG GGGTGATGGC CAGAGGAATG ATGGGCTTTT GTTCTGAGGG 1380 

GTGTCCGAGA GGCTGGTGTA TGCACTGCTC ACGGACCCCA TGTTGGATCT TTCTCCCTTT 14 40 

35 CTCCTCTCCT TTTTCTCTTC ACATCTCCCC CATAGCACCC TGCCGTCATG GGACCTGCCC 1500 

TCCCTCAGCC GTCAGCCATC AGCCATGGCC CTCCCAGTGC CTCCTAGCCC CTTCTTCCAA ' 1560 

GGAGCAGAGA GGTGGCCACC GGGGGTGGCT CTGTCCTACC TCCACTCTCT GCCCCTAAAG 1620 

40 

ATGGGAGGAG ACCAGCGGTC CATGGGTCTG GCCTGTGAGT CTCCCCTTGC AGCCTGGTCA 168 0 

CTAGGCATCA CCCCCGCTTT GGTTCTTCAG ATGCTCTTGG GGTTCATAGG GGCAGGTCCT 17 4 0 

45 AGTCGGGCAG GGCCCCTGAC CCTCCCGGCC TGGCTTCACT CTCCCTGACG GCTGCCATTG 1800 

GTCCACCCTT TCATAGAGAG GCCTGCTTTG TTACAAAGCT CGGGTCTCCC TCCTGCAGCT 18 60 

^ CGGTTAAGTA CCCGAGGCCT CTCTTAAGAT GTCCAGGGCC CCAGGCCCGC GGGCACAGCC 1920 

AGCCCAAACC TTGGGCCCTG GAAGAGTCCT CCACCCCATC ACTAGAGTGC TCTGACCCTG 1980 

GGCTTTCACG GGCCCCATTC CACCGCCTCC CCAACTTGAG CCTGTGACCT TGGGACCAAA 204 0 

55 GGGGGAGTCC CTCGTCTCTT GTGACTCAGC AGAGGCAGTG GCCACGTTCA GGGAGGGGCC 2100 

GGCTGGCCTG GAGGCTCAGC CCACCCTCCA GCTTTTCCTC AGGGTGTCCT GAGGTCCAAG 2160 

ATTCTGGAGC AATCTGACCC TTCTCCAAAG GCTCTGTTAT CAGCTGGGCA GTGCCAGCCA 2220 

ATCCCTGGCC ATTTGGCCCC AGGGGACGTG GGCCCTG 2257 



60 
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(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 411 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: amino acid (Translation of Contig 2692004) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 



15 


His 


Ala 


Asp 


Arg 


Arg Arg 


Glu 


He 


Leu 


Ala 


Lys 


Tyr 


Pro 


Glu 


He 




1 








3 










10 










15 




Lys 


Ser 


Leu 


Met 


Lys 

£. u 


Pro 


Asp 


Pro 


Asn 


Leu 
25 


He 


Trp 


He 


He 


He 
30 


20 


Met 


Met 


Val 


Leu 


inr 


Gin 


Leu 


Gly Ala 


Phe 


Tyr 


He 


Val 


Lys 


Asp 










jj 










40 










45 




Leu 


Asp 


Trp 


Lys 


Trp 
ou 


Val 


He 


true; 


C*\ \r 
uiy 


Ala 
55 


Tyr 


Ala 


Phe 


Glv 


Ser 
60 




Cys 


He 


Asn 


His 


Ser 


Met 


Thr 


Leu 


Ala 


He 


His 


Glu 


He 


Ala 


His 


25 




















70 










75 


Asn 


Ala 


Ala 


Phe 


r* i ■« » 

ft n 
o u 


Asn 


Cys 


Lys 


Ala 


Met 
85 


Trp 


Asn 


Ara 


TrD 


Phe 
90 




Gly Met 


Phe 


Ala 


Asn 


Leu 


Pro 


He 


Gly 


He 


Pro 


Tyr 


Ser 


He 


Ser 












QC 










100 










105 


30 


Phe 


Lys 


Arg 


Tyr 


nib 


Met Asp 


His 


His 


Aro 


Tyr 


Leu 


Glv 


Ala 


Asp 










inn 
1 1 u 










115 










120 




Gly Val 


Asp 


Val 


Asp 


He 


Pro 


Thr 


Asp 


Phe 


Glu Gly 


Trr> 


Phe 


Phe 












1Z o 










130 










135 




Cys 


Thr 


Ala 


Phe 


Arg 


Lys 


Phe 


He 


Trp 


Val 


He 


Leu 


Gin 


Pro 


Leu 


35 










J. H u 










145 










150 


Phe 


Tyr 


Ala 


Phe 


Arg 
loo 


Pro 


Leu 


Phe 


He 


Asn 
160 


Pro 


Lys 


Pro 


He 


Thr 
165 




Tyr 


Leu 


Glu 


Val 


He 

J. / u 


Asn 


Thr 


Val 


Ala 


Gin 
175 


Val 


Thr 


Phe 


Asp 


He 
180 


40 


Leu 


He 


Tyr 


Tyr 


Phe 


Leu 


Gly 


He 


Lys 


Ser 


Leu 


Val 


Tvr 


Met 


Leu 










185 










190 










195 




Ala 


Ala 


Ser 


Leu 


Leu 
200 


Gly 


Leu 


Gly 


Leu 


His 
205 


Pro 


He 


Ser 


Gly 


His 
210 




Phe 


He 


Ala 


Glu 


His 


Tyr 


Met 


Phe 


Leu 


Lys 


Gly 


His 


Glu 


Thr 


Tyr 


45 










215 










220 










225 


Ser 


Tyr 


Tyr 


Gly 


Pro 


Leu 


Asn 


Leu 


Leu 


Thr 


Phe 


Asn 


Val 


Gly 


Tyr 












230 










235 








240 




His 


Asn 


Glu 


His 


His 
245 


Asp 


Phe 


Pro 


Asn 


He 
250 


Pro 


Gly 


Lys 


Ser 


Leu 
255 


50 


Pro 


Leu 


Val 


Arg 


Lys 


He 


Ala 


Ala 


Glu 


Tyr 


Tyr 


Asp 


Asn 


Leu 


Pro 










260 










265 










270 




His 


Tyr 


Asn 


Ser 


Trp 
275 


He 


Lys 


Val 


Leu 


Tyr 
280 


Asp 


Phe 


Val 


Met 


Asp 
285 




Asp 


Thr 


He 


Ser 


Pro 


Tyr 


Ser 


Arg Met 


Lys 


Arg 


His 


Gin 


Lys 


Gly 


55 










290 










295 










300 


Glu 


Met 


Val 


Leu 


Glu 


★ ★ ★ 


He 


Ser 


Leu 


Val 


Pro 


Lys 


Gly 


Phe 


Phe 












305 










310 








315 




Ser 


Lys 


Thr 


Leu 


Asp 
320 


Asp 


Lys 


Met 


Glu 


Phe 
325 


Leu 


His 


Tyr 


* * * 


Thr 
330 


60 


★ ★ + 


Asp 


Gin 


* ★ * 


Cys 


Ser 


Glu 


Ala 


Pro 


Leu 


Ala 


Gin 


Phe 


Gin 


Ser 










335 










340 










345 




Lys 


Ser 


Ser 


val 


He 
350 


Pro 


Arg 


Ser 


Glu 


Ser 
355 


Gly 


Phe 


+ + * 


Thr 


Val 
360 
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10 



20 



25 



30 



35 



40 



45 



50 



Ser Leu Thr Leu Tyr Cys_Ser Val Ser Leu Thr Gly Asn Leu *** 

365 370 375 

Leu Val Tyr Tyr Arg His *** Gly Cys Phe Thr His Val Cys His 

380 385 "* 390 

Phe lie Ser lie Ser Phe Lys Lys Leu Leu Lys Ser Tyr Phe Ala 

400 405 410 

Arg 

(2) INFORMATION FOR SEQ ID NO: 39: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 218 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: amino acid (Translation of Contig 2153526) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 



Tyr 


Leu 


Leu 


Arg 


Pro 


Leu 


Leu 


Pro 


His 


Leu 


Cys 


Ala 


Thr 


He 


Gly 


1 








5 










10 










15 


Ala 


Glu 


Ser 


Phe 


Leu 


Gly 


Leu 


Phe 


Phe 


He 


Val 


Arg 


Phe 


Leu 


Glu 










20 










25 








30 


Ser 


Asn 


Trp 


Phe 


Val 
35 


Trp 


Val 


Thr 


Gin 


Met 
40 


Asn 


His 


He 


Pro 


Met 
45 


His 


He 


Asp 


His 


Asp 
50 


Arg 


Asn 


Met 


Asp 


Trp 
55 


Val 


Ser 


Thr 


Gin 


Leu 
60 


Gin 


Ala 


Thr 


Cys 


Asn 


Val 


His 


Lys 


Ser 


Ala 


Phe 


Asn 


Asp 


Trp 


Phe 










65 










70 








75 


Ser 


Gly 


His 


Leu 


Asn 
80 


Phe 


Gin 


He 


Glu 


His 
85 


His 


Leu 


Phe 


Pro 


Thr 
90 


Met 


Pro 


Arg 


His 


Asn 
95 


Tyr 


His 


Lys 


Val 


Ala 
100 


Pro 


Leu 


Val 


Gin 


Ser 
105 


Leu 


Cys 


Ala 


Lys 


His 


Gly 


He 


Glu 


Tyr 


Gin 


Ser 


Lys 


Pro 


Leu 


Leu 










110 










115 








120 


Ser 


Ala 


Phe 


Ala 


Asp 


He 


He 


His 


Ser 


Leu 


Lys 


Glu 


Ser 


Gly 


Gin 










125 










130 








135 


Leu 


Trp 


Leu 


Asp 


Ala 
140 


Tyr 


Leu 


His 


Gin 


* ★ * 
145 


Gin 


Gin 


Pro 


Pro 


Cys 
150 


Pro 


Val 


Trp 


Lys 


Lys 


Arg 


Arg 


Lys 


Thr 


Leu 


Glu 


Pro 


Arg 


Gin 


Arg 










155 










160 








165 


Gly 


Ala 


* ★ * 


Gly 


Thr 


Met 


Pro 


Leu 


* * * 


Phe 


Asn 


Thr 


Gin 


Arg 


Gly 










170 










175 








180 


Leu 


Gly 


Leu 


Gly 


Thr 
185 


** * 


Ser 


Leu 


* * ★ 


Leu 
190 


Lys 


Leu 


Leu 


Pro 


Phe 
195 


He 


Phe 


+ * + 


Pro 


Gin 


Phe 


★ * ★ 


Asp 


Pro 


Lys 


Trp 


Gly Val 


Asp 


Thr 










200 










205 








210 


Glu 


Val 


Pro 


Arg 


Arg 
215 


Glu 


Gly 


Ala 

















55 (2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 

60 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: amino acid (Translation of Contig 3506132) 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

5 



vox 


Phe Tyr 


Phe 


uiy 


Asn 


Gly 


Trp 


He 


Pro 


Tnr 


Leu 


He 


Thr 


Ala 


1 






5 










10 










15 


Phe 


Val Leu 


Ala 


Thr 


Ser 


Gin 


Ala 


Gin 


Ala 


Gly 


Trp 


Leu 


Gin 


His 








20 










25 










30 


Asp 


Tyr Gly 


His 


Leu 


Ser 


Val 


Tyr 


Arg 


Lys 


Pro 


Lys 


Trp Asn 


His 








35 










40 










45 


Leu 


Val His 


Lys 


Phe 


Val 


lie 


Gly 


His 


Leu 


Lys 


Gly Ala 


Ser 


Ala 








50 










55 










60 


Asn 


Trp Trp 


Asn 


His 


Arg 


His 


Phe 


Gin 


His 


His 


Ala 


Lys 


Pro 


Asn 








65 










70 










75 


Leu 


Gly Glu 


Trp 


Gin 


Pro 


He 


Glu 


Tyr 


Gly 


Lys 


Xxx 














80 










85 












(2) 


INFORMATION 


FOR 


SEQ 


ID NO:41: 















(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 306 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: amino acid (Translation of Contig 3854933) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 



35 


Gin 
1 


Gly 


Pro 


Thr 


Pro 
5 


Arg 


Tyr 


Phe 


Thr 


Trp 
10 


Asp 


Glu 


Val 


Ala 


Gin 
15 




Arg 


Ser 


Gly 


Cys 


Glu 
20 


Glu 


Arg 


Trp 


Leu 


Val 
25 


He 


Asp 


Arg 


Lys 


Val 
30 


40 


Tyr 


Asn 


He 


Ser 


Glu 


Phe 


Thr 


Arg 


Arg 


His 


Pro 


Gly 


Gly 


Ser 


Arg 










35 










40 






45 




Val 


He 


Ser 


His 


Tyr 


Ala 


Gly 


Gin 


Asp 


Ala 


Thr Asp 


Pro 


Phe 


Val 












50 










55 










60 




Ala 


Phe 


His 


He 


Asn 


Lys 


Gly 


Leu 


Val 


Lys 


Lys 


Tyr 


Met 


Asn 


Ser 


45 










65 










70 








75 


Leu 


Leu 


He 


Gly 


Glu 
80 


Leu 


Ser 


Pro 


Glu 


Gin 
85 


Pro 


Ser 


Phe 


Glu 


Pro 
90 




Thr 


Lys 


Asn 


Lys 


Glu 


Leu 


Thr 


Asp 


Glu 


Phe 


Arg 


Glu 


Leu 


Arg 


Ala 












95 










100 








105 


50 


Thr 


Val 


Glu 


Arg 


Met 


Gly 


Leu 


Met 


Lys 


Ala 


Asn 


His 


Val 


Phe 


Phe 










110 










115 










120 




Leu 


Leu 


Tyr 


Leu 


Leu 
125 


His 


He 


Leu 


Leu 


Leu 
130 


Asp 


Gly 


Ala 


Ala 


Trp 
135 




Leu 


Thr 


Leu 


Trp 


Val 


Phe 


Gly 


Thr 


Ser 


Phe 


Leu 


Pro 


Phe 


Leu 


Leu 


55 










140 










145 










150 


Cys 


Ala 


Val 


Leu 


Leu 


Ser 


Ala 


Val 


Gin 


Ala 


Gin 


Ala 


Gly 


Trp 


Leu 












155 










160 






165 




Gin 


His 


Asp 


Phe 


Gly 


His 


Leu 


Ser 


Val 


Phe 


Ser 


Thr 


Ser 


Lys 


Trp 












170 










175 








180 


60 


Asn 


His 


Leu 


Leu 


His 


His 


Phe 


Val 


He 


Gly 


His 


Leu 


Lys 


Gly 


Ala 










185 










190 






195 




Pro 


Ala 


Ser 


Trp 


Trp 


Asn 


His 


Met 


His 


Phe 


Gin 


His 


His 


Ala 


Lys 



200 205 210 
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Pro 


Asn 


Cys 


Pne 


Arg 
215 


Lys 


Asp 


Pro 


Asp 


He 
220 


Asn 


Met 


His 


Pro 


Pne 
225 




Pne 


Pne 


Ala 


Leu 


r* i * > 
Caly 

£- ~> U 


Lys 


lie 


Leu 


Ser 


val 

£. J J 


CalU 


Leu 


uiy 


Lys 


bin 


5 


Lys 


Lys 


Lys 


Tyr 


Met 
245 


Pro 


Tyr 


Asn 


His 


Gin 
250 


His 


Xxx 


Tyr 


Phe 


Phe 
255 




Leu 


He 


Gly 


Pro 


Pro 
260 


Ala 


Leu 


Leu 


Pro 


Leu 
265 


Tyr 


Phe 


Gin 


Trp 


Tyr 
270 


10 


He 


Phe 


Tyr 


Phe 


Val 


He 


Gin 


Arg 


Lys 


Lys 


Trp 


Val 


Asp 


Leu 


Ala 










275 










280 










285 




Trp 


He 


Ser 


Lys 


Gin 
290 


Glu 


Tyr 


Asp 


Glu 


Ala 
295 


Gly 


Leu 


Pro 


Leu 


Ser 
300 




Thr 


Ala 


Asn 


Ala 


Ser 


Lys 





















305 

15 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 566 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: amino acid (Translation of Contig 2511785) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 



30 



35 



40 



45 



50 



55 



60 



His 


Leu 


Lys 


Gly 


Ala 


Ser 


Ala 


Asn 


Trp 


Trp 


Asn 


His 


Arg 


His 


Phe 


1 








5 










10 










15 


Gin 


His 


His 


Ala 


Lys 


Pro 


Asn 


He 


Phe 


His 


Lys 


Asp 


Pro 


Asp 


Val 










20 










25 










30 


Asn 


Met 


Leu 


His 


Val 


Phe 


Val 


Leu Gly Glu 


Trp 


Gin 


Pro 


He 


Glu 










35 










40 










45 


Tyr 


Gly 


Lys 


Lys 


Lys 


Leu 


Lys 


Tyr 


Leu 


Pro 


Tyr 


Asn 


His 


Gin 


His 










50 










55 










60 


Glu 


Tyr 


Phe 


Phe 


Leu 


He 


Gly 


Pro 


Pro 


Leu 


Leu 


He 


Pro 


Met 


Tyr 










65 










70 










75 


Phe 


Gin 


Tyr 


Gin 


He 


He 


Met 


Thr 


Met 


He 


Val 


His 


Lys 


Asn 


Trp 










80 










85 










90 


Val 


Asp 


Leu 


Ala 


Trp 


Ala 


Val 


Ser 


Tyr 


Tyr 


He 


Arg 


Phe 


Phe 


He 










95 










100 










105 


Thr 


Tyr 


He 


Pro 


Phe 


Tyr 


Gly 


He 


Leu 


Gly 


Ala 


Leu 


Leu 


Phe 


Leu 










110 










115 










120 


Asn 


Phe 


He 


Arg 


Phe 


Leu 


Glu 


Ser 


His 


Trp 


Phe 


Val 


Trp 


Val 


Thr 










125 










130 










135 


Gin 


Met 


Asn 


His 


He 


Val 


Met 


Glu 


He 


Asp 


Gin 


Glu 


Ala 


Tyr 


Arg 










140 










145 










150 


Asp 


Trp 


Phe 


Ser 


Ser 


Gin 


Leu 


Thr 


Ala 


Thr 


Cys 


Asn 


Val 


Glu 


Gin 










155 










160 










165 


Ser 


Phe 


Phe 


Asn 


Asp 


Trp 


Phe 


Ser 


Gly 


His 


Leu 


Asn 


Phe 


Gin 


He 










170 










175 










180 


Glu 


His 


His 


Leu 


Phe 


Pro 


Thr 


Met 


Pro Arg 


His 


Asn 


Leu 


His 


Lys 










185 










190 










195 


He 


Ala 


Pro 


Leu 


Val 


Lys 


Ser 


Leu 


Cys 


Ala 


Lys 


His 


Gly 


He 


Glu 










200 










205 










210 


Tyr 


Gin 


Glu 


Lys 


Pro 


Leu 


Leu 


Arg 


Ala 


Leu 


Leu 


Asp 


He 


He 


Arg 










215 










220 










225 


Ser 


Leu 


Lys 


Lys 


Ser 


Gly 


Lys 


Leu 


Trp 


Leu 


Asp 


Ala 


Tyr 


Leu 


His 










230 










235 










240 


Lys 


** * 


Ser 


His 


Ser 


Pro 


Arg Asp 


Thr 


Val 


Gly 


Lys 


Gly Cys 


Arg 
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55 











245 










250 










255 


irp 


ui y 


Asp Gly 


Pin 


Arg Asn 


Asp 


(j±y 


Leu 


Leu 


Phe 




Gly 


Val 










260 










265 








270 


Ofcr £ 


Pin 


Arg 


Leu 


vdi 
t / 3 


Tyr 


Ala 


Leu 


Leu 


Thr 


Asp 


Pro 


Met 


Leu 


Asp 
285 


Leu 


Se r 


Pro 


Fne 


Leu 
290 


Leu 


Ser 


Phe 


Fne 


Ser 
295 


Ser 


His 


Leu 


Pro 


His 
300 


Ser 


inr 


Leu 


irro 


Ser 


Trp Asp 


Leu 


Pro 


Ser 


Leu 


Ser 


Arg 


Gin 


Pro 




















310 










315 


Ser 




Met 


Ala 


Leu 
320 


Pro 


Val 


Pro 


Pro 


Ser 
325 


Pro 


Phe 


Phe 


Gin 


Gly 
330 


/\ia 




Arg 


Trp 


Pro 
335 


Pro 


Gly 


Val 


Ala 


Leu 
340 


Ser 


Tyr 


Leu 


His 


Ser 
345 


Leu 


Pro 


Leu 


Lys 


Kyi _ X- 

Met 


Gly Gly Asp 


Gin 


Arg 


Ser 


Met 


Gly 


Leu 


Ala 










Jou 










355 










360 


Cys 


C*1U 


Ser 


Pro 


Leu 
365 


Ala 


Ala 


Trp 


Ser 


Leu 
370 


Gly 


He 


Thr 


Pro 


Ala 
375 


Leu 


vai 


Leu 


Cain 


Met 


Leu 


Leu 


Gly 


Phe 


He 


Gly 


Ala Gly 


Pro 


Ser 










380 










385 










390 


Arg 


fiJ.a 


b±y 


Pro 


Leu 
400 


Thr 


Leu 


Pro 


Ala 


Trp 
405 


Leu 


His 


Ser 


Pro 


* * * 
410 


Arg 


Leu 


Pro 


Leu 


vai 
415 


His 


Pro 


Phe 


He 


Glu 
420 


Arg 


Pro 


Ala 


Leu 


Leu 
425 




C y 


Ser 


oiy 


Leu 


Pro 


Pro 


Ala 


Ala 


Arg 


Leu 


Ser 


Thr 


Arg 


Gly 










1 J u 


















440 


Leu 




+ * * 


Asp 


Val 
i *i i> 


Gin 


Gly 


Pro 


Arg 


Pro 

a en 


Ala 


Gly 


Thr 


Ala 


Ser 
455 


tr l, 


Asn 


Leu 


Gly 


Pro 
4 An 


Trp 


Lys 


Ser 


Pro 


Pro 

A C C 

4 65 


Pro 


His 


His 




Ser 
470 


Al ^ 

nid 


Leu 


Thr 


Leu 


uly 
4 "7 ^ 


Phe 


His 


Gly 


Pro 


His 
480 


Ser 


Thr 


Ala 


Ser 


Pro 
485 


1 HIT 


* * * 


Ala 


Cys 


Asp 
490 


Leu 


Gly 


Thr 


Lys 


Gly 
495 


Gly 


Val 


Pro 


Arg 


Leu 
500 


Leu 


-A- + 


Leu 


Ser 


Arg 


Gly 


Ser 


Gly His 


Val 


Gin 


Gly Gly 


Ala 


Gly 










505 










510 










515 


Trp 


Pro 


Gly Gly 


Ser 


Ala 


His 


Pro 


Pro 


Ala 


Phe 


Pro 


Gin 


Gly 


Val 










520 










525 








530 


Leu 


Arg 


Ser 


Lys 


He 


Leu 


Glu 


Gin 


Ser 


Asp 


Pro 


Ser 


Pro 


Lys 


Ala 










535 










540 








545 


Leu 


Leu 


Ser 


Ala 


Gly 


Gin 


Cys 


Gin 


Pro 


He 


Pro 


Gly 


His 


Leu 


Ala 










550 










555 








560 


Pro 


Gly 


Asp 


Val 


Gly 
565 


Pro 


Xxx 



















(2) INFORMATION FOR SEQ ID NO: 43: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 619 amino acids 
50 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: amino acid (Translation of Contig 2535) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 



60 Val Phe Tyr Phe Gly Asn Gly Trp He Pro Thr Leu He Thr Ala 

1 5 10 15 

Phe Val Leu Ala Thr Ser Gin Ala Gin Ala Gly Trp Leu Gin His 
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60 











2 0 










2o 












Asp 


Tyr 


Gly 


HIS 


Leu 


Ser 


val 


Tyr Arg 


Lys 


Pro 


Lys 


Trp 


Asn 


rllS 










"a q 










A <\ 

4 0 










4 o 


Leu 


val 


His 


Lys 


Pne 


val 


lie 


Gly 


His 


Leu 


Lys 


Gly 


Ala 


Ser 


Ala 










DO 










55 










bO 


Asn 


Trp 


Trp 


Asn 


HIS 


Arg 


HIS 


Pne 


Gin 


HIS 


His 


■ft 1 _ 

Ala 


Lys 


Pro 


TV r* ti. 

Asn 










DO 










I 0 










1 D 


XT*-. 

lie 


Pne 


Hxs 


Lys 


Asp 


Pro 


Asp 


Val 


Asn 


Met 


Leu 


His 


Val 


Pne 


Val 










o ft 










DO 










yo 


Leu 


Gly 


Glu 


Trp 


Gin 


Pro 


He 


Glu 


Tyr 


Gly 


Lys 


Lys 


Lys 


Leu 


Lys 










95 










100 










105 


Tyr 


Leu 


Pro 


Tyr 


Asn 


His 


Gin 


His 


Glu 


Tyr 


Phe 


Phe 


Leu 


He 


Gly 










110 










115 










120 


Pro 


Pro 


Leu 


Leu 


He 


Pro 


Met 


Tyr 


Phe 


Gin 


Tyr 


Gin 


He 


He 


Met 










toe 

125 










1 *3 ft 

130 










135 


Thr 


Met 


He 


Val 


His 


Lys 


Asn 


Trp 


Val 


Asp 


Leu 


Ala 


Trp 


Ala 


Val 










"1 A ft 

14 0 










145 










T C. ft 

150 


Ser 


Tyr 


Tyr 


He 


Arg 


Phe 


Phe 


He 


Thr 


Tyr 


He 


Pro 


Phe 


Tyr 


Gly 










155 










160 










165 


X 1 — 

He 


Leu 


Gly Ala 


Leu 


Leu 


Phe 


Leu 


Asn 


Phe 


He 


Arg 


Phe 


Leu 


Glu 










1 /O 










175 










1 O ft 

loO 


Ser 


His 


Trp 


irne 


Val 


Trp 


Val 


Thr 


Gin 


Met 


Asn 


His 


I le 


Val 


Met 










lot 










ion 

iy o 










iyo 


GlU 


X 1 ^ 

lie 


Asp 


uin 


GiU 


TV 1 — 

Ala 


Tyr 


Arg 


Asp 


Trp 


Phe 


Ser 


Ser 


/— i _ 
Gin 


Leu 










zuu 










o n q 
2 UD 










210 


Thr 


Ala 


inr 


Cys 


Asn 


Val 


Glu 


Gin 


Ser 


Phe 


Phe 


Asn 


Asp 


Trp 


Phe 




















o o n 
22 U 










220 


Ser 


Gly 


H \ Q 

ni o 


Leu 


Asn 


pne 


Gin 


X 1 a 

lie 


GlU 


His 


His 


Leu 


Phe 


Pro 


Thr 




















235 










*"> A ft 

240 


Met 


Pro 


Arg 


His 


Asn 


Leu 


His 


Lys 


lie 


Ala 


Pro 


Leu 


Val 


Lys 


Ser 










O /l ^ 










250 










o c c 
2oo 


Leu 


Cys 


/\ia 


Lys 


HIS 


Gly 


He 


Glu 


Tyr 


Gin 


Glu 


Lys 


Pro 


Leu 


Leu 










260 










265 










270 


Arg Ala 


Leu 


Leu 


Asp 


lie 


TI- 


Arg 


Ser 


Leu 


Lys 


Lys 


Ser 


Gly 


Lys 










o n c 

275 










280 










285 


Leu 


Trp 


Leu 


Asp 


Ala 


Tyr 


Leu 


His 


Lys 


★ * * 


Ser 


His 


Ser 


Pro 


Arg 










290 




- 






295 










300 


Asp 


Thr 


Val 


Gly 


Lys 


Gly 


Cys 


Arg 


Trp 


Gly Asp 


Gly 


Gin 


Arg 


Asn 




















310 










olo 


Asp 


Gly 


Leu 


Leu 


pne 


* * ■* 


Gly 


Val 


Ser 


Glu 


Arg 


Leu 


Val 


Tyr 


Ala 










■5 9 n 










325 










*3 ^ n 


Leu 


Leu 


Thr 


Asp 


Pro 


Ma4- 


Leu 


Asp 


Leu 


Ser 


Pro 


Pne 


Leu 


Leu 


Ser 




















340 












Phe 


Phe 


Ser 


Ser 


nlS 


Leu 


Pro 


His 


Ser 


Thr 


Leu 


Pro 


Ser 


Trp 


Asp 










r\ 










355 










360 


Leu 


Pro 


Ser 


Leu 


ber 


Arg 


Gin 


Pro 


Ser 


Ala 


Met 


Ala 


Leu 


Pro 


Val 










365 










370 










o / o 


Pro 


Pro 


Ser 


Pro 


Phe 


pne 


Gin 


Gly Ala 


Glu 


Arg 


Trp 


Pro 


Pro 


Gly 










380 










385 










390 


Val 


Ala 


Leu 


Ser 


Tyr 


Leu 


His 


Ser 


Leu 


Pro 


Leu 


Lys 


Met 


Gly 


Gly 










400 










405 










410 


Asp 


Gin 


Arg 


Ser 


Met 


Gly 


Leu 


Ala 


Cys 


Glu 


Ser 


Pro 


Leu 


Ala 


Ala 










415 










420 












Trp 


Ser 


Leu 


Gly 


He 


Thr 


Pro 


Ala 


Leu 


Val 


Leu 


Gin 


Met 


Leu 


Leu 










430 










435 










440 


Gly 


Phe 


He 


Gly Ala 


Gly 


Pro 


Ser 


Arg 


Ala 


Gly 


Pro 


Leu 


Thr 


Leu 










445 










450 










455 


Pro 


Ala 


Trp 


Leu 


His 


Ser 


Pro 


* * * 


Arg 


Leu 


Pro 


Leu 


Val 


His 


Pro 










460 










465 










470 


Phe 


He 


Glu 


Arg 


Pro 


Ala 


Leu 


Leu 


Gin 


Ser 


Ser 


Gly 


Leu 


Pro 


Pro 










475 










480 










485 


Ala 


Ala 


Arg 


Leu 


Ser 


Thr 


Arg 


Gly Leu 


Ser 


★ + ★ 


Asp 


Val 


Gin 


Gly 
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490 495 500 





Pro 


Arg 


Pro 


Ala 


Gly 


Thr 


Ala 


Ser 


Pro 


Asn 


Leu 


Gly 


Pro 


Trp 


Lys 












OUj 










510 








515 




Ser 


Pro 


rro 


Pro 


LI t r- 

nlS 


HIS 


* + * 


Ser 


Ala 


Leu 


Thr 


Leu 


Gly 


Phe 


His 












*\ o o 










525 








530 




Gly 


Pro 


His 


Ser 


Thr 


Ala 


Ser 


Pro 


Thr 


+ + * 


nxa 




Asp 


Leu 


o±y 












535 










540 








545 




Thr 


Lys 


Gly 


Gly 


Val 


Pro 


Arg 


Leu 


Leu 


* + + 


Leu 


Ser 


Arg 


Gly 


Ser 


10 










550 










555 










560 


Gly 


His 


Val 


Gin 


Gly 
565 


Gly 


Ala 


Gly 


Trp 


Pro 
570 


Gly 


Gly 


Ser 


Ala 


His 
575 




Pro 


Pro 


Ala 


Phe 


Pro 
580 


Gin 


Gly 


Val 


Leu 


Arg 
585 


Ser 


Lys 


He 


Leu 


Glu 
590 


15 


Gin 


Ser 


Asp 


Pro 


Ser 


Pro 


Lys 


Ala 


Leu 


Leu 


Ser 


Ala 


Gly 


Gin 


Cys 










595 










600 








605 




Gin 


Pro 


He 


Pro 


Gly 


His 


Leu 


Ala 


Pro 


Gly 


Asp 


Val 


Gly 


Pro 


Xxx 



610 615 620 



20 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 757 amino acids 
25 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: amino acid (Translation of Contiq 253538a) 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 



Glu Val Ala Gin 
15 

Asp Arg Lys Val 
30 

Gly Gly Ser Arg 
45 

Asp Pro Phe Val 
60 

Tyr Met Asn Ser 
75 

Ser Phe Glu Pro 
90 

Glu Leu Arg Ala 
105 

His Val Phe Phe 
120 

Gly Ala Ala Trp 
135 

Pro Phe Leu Leu 
150 

Gin Ala Gly Trp 
165 

Arg Lys Pro Lys 
180 

His Leu Lys Gly 
195 

Gin His His Ala 
210 

Asn Met Leu His 



35 



40 



45 



50 



55 



60 



Gin 


Gly 


Pro 


Thr 


Pro 


Arg 


Tyr 


Phe 


Thr 


Trp 


Asp 


1 








5 










10 




Arg 


Ser 


Gly 


Cys 


Glu 


Glu 


Arg 


Trp 


Leu 


Val 


He 










20 










25 




Tyr 


Asn 


He 


Ser 


Glu 


Phe 


Thr 


Arg 


Arg 


His 


Pro 










35 










40 




Val 


He 


Ser 


His 


Tyr 


Ala 


Gly 


Gin 


Asp 


Ala 


Thr 










50 










55 




Ala 


Phe 


His 


He 


Asn 


Lys 


Gly 


Leu 


Val 


Lys 


Lys 










65 










70 


Leu 


Leu 


He 


Gly 


Glu 


Leu 


Ser 


Pro 


Glu 


Gin 


Pro 










80 










85 




Thr 


Lys 


Asn 


Lys 


Glu 


Leu 


Thr 


Asp 


Glu 


Phe 


Arg 










95 










100 


Thr 


Val 


Glu 


Arg 


Met 


Gly 


Leu 


Met 


Lys 


Ala 


Asn 










110 










115 




Leu 


Leu 


Tyr 


Leu 


Leu 


His 


He 


Leu 


Leu 


Leu 


Asp 










125 










130 


Leu 


Thr 


Leu 


Trp 


Val 


Phe 


Gly 


Thr 


Ser 


Phe 


Leu 










140 










145 




Cys 


Ala 


Val 


Leu 


Leu 


Ser 


Ala 


Val 


Gin 


Gin 


Ala 










155 










160 




Leu 


Gin 


His 


Asp 


Tyr 


Gly 


His 


Leu 


Ser 


Val 


Tyr 










170 










175 


Trp 


Asn 


His 


Leu 


Val 


His 


Lys 


Phe 


Val 


He 


Gly 










185 










190 


Ala 


Ser 


Ala 


Asn 


Trp 


Trp 


Asn 


His 


Arg 


His 


Phe 










200 










205 




Lys 


Pro 


Asn 


He 


Phe 


His 


Lys 


Asp 


Pro 


Asp 


Val 
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215 










220 










225 




Val 


Phe 


Val 


Leu 


Gly 


Glu 


Trp 


Gin 


Pro 


He 


Glu 


Tyr 


Gly 


Lys 


Lys 












230 










235 










240 




Lys 


Leu 


Lys 


Tyr 


Leu 


Pro 


Tyr 


Asn 


His 


Gin 


His 


Glu 


Tyr 


Phe 


Phe 


5 










245 










250 










255 




Leu 


lie 


Gly 


Pro 


Pro 


Leu 


Leu 


lie 


Pro 


Met 


Tyr 


Phe 


Gin 


Tyr 


Gin 












260 










265 










270 




lie 


lie 


Met 


Thr 


Met 


lie 


Val 


His 


Lys 


Asn 


Trp 


Val 


Asp 


Leu 


Ala 












275 










280 










285 


10 


Trp 


Ala 


Val 


Ser 


Tyr 


Tyr 


lie 


Arg 


Phe 


Phe 


He 


Thr 


Tyr 


He 


Pro 












290 










295 










300 




Phe 


Tyr 


Gly 


lie 


Leu 


Gly 


Ala 


Leu 


Leu 


Phe 


Leu 


Asn 


Phe 


He 


Arg 












305 










310 










315 




Phe 


Leu 


Glu 


Ser 


His 


Trp 


Phe 


Val 


Trp 


Val 


Thr 


Gin 


Met 


Asn 


His 


15 










320 










325 










330 




lie 


Val 


Met 


Glu 


lie 


Asp 


Gin 


Glu 


Ala 


Tyr 


Arg 


Asp 


Trp 


Phe 


Ser 












335 










340 










345 




Ser 


Gin 


Leu 


Thr 


Ala 


Thr 


Cys 


Asn 


Val 


Glu 


Gin 


Ser 


Phe 


Phe 


Asn 


20 










350 










355 










360 


Asp 


Trp 


Phe 


Ser 


Gly 


His 


Leu 


Asn 


Phe 


Gin 


He 


Glu 


His 


His 


Leu 












365 










370 










375 




Phe 


Pro 


Thr 


Met 


Pro 


Arg 


His 


Asn 


Leu 


His 


Lys 


He 


Ala 


Pro 


Leu 












380 










385 










390 


25 


Val 


Lys 


Ser 


Leu 


Cys 


Ala 


Lys 


His 


Gly 


He 


Glu 


Tyr 


Gin 


Glu 


Lys 










400 










405 










410 




Pro 


Leu 


Leu 


Arg 


Ala 


Leu 


Leu 


Asp 


He 


He 


Arg 


Ser 


Leu 


Lys 


Lys 












415 










420 










425 




Ser 


Gly 


Lys 


Leu 


Trp 


Leu 


Asp 


Ala 


Tyr 


Leu 


His 


Lys 


* + * 


Ser 


His 


30 










430 










435 










440 


Ser 


Pro 


Arg 


Asp 


Thr 


Val 


Gly 


Lys 


Gly 


Cys 


Arg 


Trp 


Gly 


Asp 


Gly 












445 










450 










455 




Gin Arg Asn Asp 


Gly 


Leu 


Leu 


Phe 


* * * 


Gly Val 


Ser 


Glu 


Arg 


Leu 












460 










465 










470 


35 


Val 


Tyr 


Ala 


Leu 


Leu 


Thr 


Asp 


Pro 


Met 


Leu 


Asp 


Leu 


Ser 


Pro 


Phe 










475 










480 








485 




Leu 


Leu 


Ser 


Phe 


Phe 


Ser 


Ser 


His 


Leu 


Pro 


His 


Ser 


Thr 


Leu 


Pro 












490 










495 










500 




Ser 


Trp 


Asp 


Leu 


Pro 


Ser 


Leu 


Ser 


Arg 


Gin 


Pro 


Ser 


Ala 


Met 


Ala 


40 










c. r\ c 

505 










510 










515 


Leu 


Pro 


Val 


Pro 


Pro 


Ser 


Pro 


Phe 


Phe 


Gin 


Gly 


Ala 


Glu 


Arg 


Trp 












520 










525 










530 




Pro 


Pro 


Gly 


Val 


Ala 


Leu 


Ser 


Tyr 


Leu 


His 


Ser 


Leu 


Pro 


Leu 


Lys 












535 










540 










545 


45 


Met 


Gly Gly Asp 


Gin 


Arg 


Ser 


Met 


Gly 


Leu 


Ala 


Cys 


Glu 


Ser 


Pro 










550 










555 








560 




Leu 


Ala 


Ala 


Trp 


Ser 


Leu 


Gly 


He 


Thr 


Pro 


Ala 


Leu 


Val 


Leu 


Gin 












565 










570 










575 




Met 


Leu 


Leu Gly 


Phe 


lie 


Gly Ala 


Gly 


Pro 


Ser 


Arg 


Ala 


Gly 


Pro 


50 










580 










585 










590 


Leu 


Thr 


Leu 


Pro 


Ala 


Trp 


Leu 


His 


Ser 


Pro 


★ * ★ 


Arg 


Leu 


Pro 


Leu 












595 










600 








605 




Val 


His 


Pro 


Phe 


lie 


Glu 


Arg 


Pro 


Ala 


Leu 


Leu 


Gin 


Ser 


Ser 


Gly 












<3iU 










615 










620 


55 


Leu 


Pro 


Pro 


Ala 


Ala 


Arg 


Leu 


Ser 


Thr 


Arg 


Gly 


Leu 


Ser 


★ * * 


Asp 










625 










630 








635 




Val 


Gin 


Gly 


Pro 


Arg 


Pro 


Ala 


Gly 


Thr 


Ala 


Ser 


Pro 


Asn 


Leu 


Gly 












640 










645 










650 




Pro 


Trp 


Lys 


Ser 


Pro 


Pro 


Pro 


His 


His 


★ ★ * 


Ser 


Ala 


Leu 


Thr 


Leu 


60 










655 










660 










665 


Gly 


Phe 


His 


Gly 


Pro 


His 


Ser 


Thr 


Ala 


Ser 


Pro 


Thr 


+ * ★ 


Ala 


Cys 












670 










675 










680 




Asp 


Leu 


Gly 


Thr 


Lys 


Gly Gly Val 


Pro 


Arg 


Leu 


Leu 


★ * ★ 


Leu 


Ser 
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685 








Arg 


Gly 


Ser 


Gly 


His 
700 


Val 


Gin 




Ser 


Ala 


His 


Pro 


Pro 


Ala 


Phe 


5 










715 








He 


Leu 


Glu 


Gin 


Ser 
730 


Asp 


Pro 




Gly 


Gin 


Cys 


Gin 


Pro 


He 


Pro 


10 










745 






Gly 


Pro 


Xxx 















690 






695 


Gly 


Gly Ala 


Gly 


Trp Pro 


Gly Gly 






705 






710 


Pro 


Gin 


Gly 


Val 


Leu Arg 


Ser Lys 






720 






725 


Ser 


Pro 


Lys 


Ala 


Leu Leu 


Ser Ala 






735 






740 


Gly 


His 


Leu 


Ala 


Pro Gly 


Asp Val 






750 






755 



(2) INFORMATION FOR SEQ ID NO: 45: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 6 nucleic acids 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



20 



50 



(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 



25 CGTATGTCAC TCCATTCCAA ACTCGTTCAT GGTATCATAA ATATCAACAC ATTTACGCTC 60 

CACTCCTCTA TGGTATTTAC ACACTCAAAT ATCGTACTCA AGATTGGGAA GCTTTTGTAA 120 

AGGATGGTAA AAATGGTGCA ATTCGTGTTA GTGTCGCCAC AAATTTCGAT AAGGCCGCTT 180 

ACGTCATTGG TAAATTGTCT TTTGTTTTCT TCCGTTTCAT CCTTCCACTC CGTTATCATA 240 

GCTTTACAGA TTTAATTTGT TATTTCCTCA TTGCTGAATT CGTCTTTGGT TGGTATCTCA 300 

30 CAATTAATTT CCAAGTTAGT CATGTCGCTG AAGATCTCAA ATTCTTTGCT ACCCCTGAAA 3 60 

GACCAGATGA ACCATCTCAA ATCAATGAAG ATTGGGCAAT CCTTCAACTT AAAACTACTC 420 

AAGATTATGG TCATGGTTCA CTCCTTTGTA CCTTTTTTAG TGGTTCTTTA AATCATCAAG 480 

TTGTTCATCA TTTATTCCCA TCAATTGCTC AAGATTTCTA CCCACAACTT GTACCAATTG 540 

TAAAAGAAGT TTGTAAAGAA CATAACATTA CTTACCACAT TAAACCAAAC TTCACTGAAG 600 

35 CTATTATGTC ACACATTAAT TACCTTTACA AAATGGGTAA TGATCCAGAT TATGTTAAAA 660 

AACCATTAGC CT CAAAAG AT GATTAAATGA AATAACTTAA AAACCAATTA TTTACTTTTG 720 

ACAAACAGTA ATATTAATAA ATACAA 74 6 

40 (2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 amino acids 

(B) TYPE: amino acid 

45 (C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 



Tyr Val Thr Pro Phe Gin Thr Arg Ser Trp Tyr His Lys Tyr Gin 

1 5 10 15 

His lie Tyr Ala Pro Leu Leu Tyr Gly He Tyr Thr Leu Lys Tyr 

55 20 25 30 

Arg Thr Gin Asp Trp Glu Ala Phe Val Lys Asp Gly Lys Asn Gly 

35 40 45 

Ala He Arg Val Ser Val Ala Thr Asn Phe Asp Lys Ala Ala Tyr 

OU val He Gly Lys Leu Ser Phe Val Phe Phe Arg Phe He Leu Pro 

65 70 75 

Leu Arg Tyr His Ser Phe Thr Asp Leu He Cys Tyr Phe Leu He 

80 85 90 

Ala Glu Phe Val Phe Gly Trp Tyr Leu Thr He Asn Phe Gin Val 

95 100 105 



65 
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Ser His 


Val 


Ala 


Glu Asp 
110 


Leu 




Pro Asp 


Glu 


Pro 


Ser Gin 
125 


He 


5 


Leu Lys 


Thr 


Thr 


Gin Asp 
140 


Tyr 




Phe Phe 


Ser 


Gly 


Ser Leu 
155 


Asn 




Pro Ser 


lie 


Ala 


Gin Asp 


Phe 


10 








170 






Lys Glu 


Val 


Cys 


Lys Glu 
185 


His 




Asn Phe 


Thr 


Glu 


Ala He 
200 


Met 


15 


Met Gly Asn 


Asp 


Pro Asp Tyr 










215 






Asp Asp 


* * ★ 









Lys 


Phe 


Phe 
115 


Ala 


Thr 


Pro 


Glu 


Arg 
120 


Asn 


Glu 


Asp 
130 


Trp 


Ala 


He 


Leu 


Gin 
135 


Gly 


His 


Gly 
145 


Ser 


Leu 


Leu 


Cys 


Thr 
150 


His 


Gin 


Val 
160 


Val 


His 


His 


Leu 


Phe 
165 


Tyr 


Pro 


Gin 
175 


Leu 


Val 


Pro 


He 


Val 
180 


Asn 


He 


Thr 
190 


Tyr 


His 


He 


Lys 


Pro 
195 


Ser 


His 


He 
205 


Asn 


Tyr 


Leu 


Tyr 


Lys 
210 


Val 


Lys 


Lys 
220 


Pro 


Leu 


Ala 


Ser 


Lys 
225 



20 (2) INFORMATION FOR SEQ ID NO 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 94 nucleic acids 

(B) TYPE: nucleic acid 

25 (C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

30 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 47: 



TTTTGGAAGG NTCCAAGTTN ACCACGGANT NGGCAAGTTN ACGGGGCGGA AANCGGTTTT 60 

CCCCCCAAGC CTTTTGTCGA CTGGTTCTGT GGTGGCTTCC AGTACCAAGT CGACCACCAC 120 

35 TTATTCCCCA GCCTGCCCCG ACACAATCTG GCCAAGACAC ACGCACTGGT CGAATCGTTC 180 

TGCAAGGAGT GGGGTGTCCA GTACCACGAA GCCGACCTCG TGGACGGGAC CATGGAAGTC 240 

TTGCACCATT TGGGCAGCGT GGCCGGCGAA TTCGTCGTGG ATTTTGTACG CGACGGACCC 300 

GCCATGTAAT CGTCGTTCGT GACGATGCAA GGGTTCACGC ACATCTACAC ACACTCACTC 360 

ACACAACTAG TGTAACTCGT ATAGAATTCG GTGTCGACCT GGACCTTGTT TGACTGGTTG 420 

40 GGGATAGGGT AGGTAGGCGG ACGCGTGGGT CGNCCCCGGG AATTCTGTGA CCGGTACCTG 4 80 

GCCCGCGTNA AAGT 494 



45 (2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

50 (C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 



Phe 


Trp 


Lys Xxx 


Pro 


Ser 


Xxx 


Pro Arg 


Xxx 


Xxx 


Gin 


Val 


Xxx Gly 


1 






5 










10 








15 


Ala 


Glu 


Xxx Gly 


Phe 


Pro 


Pro 


Lys 


Pro 


Phe Val Asp 


Trp 


Phe Cys 








20 










25 








30 


Gly Gly 


Phe Gin 


Tyr 


Gin 


Val 


Asp 


His 


His 


Leu 


Phe 


Pro 


Ser Leu 








35 










40 








45 


Pro 


Arg 


His Asn 


Leu 


Ala 


Lys 


Thr 


His 


Ala 


Leu 


Val 


Glu 


Ser Phe 








50 










55 








60 


Cys 


Lys 


Glu Trp Gly Val 


Gin 


Tyr 


His 


Glu 


Ala 


Asp 


Leu Val Asp 








65 










70 








75 
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10 



20 



55 



60 



65 



Gly Thr Met Glu Val Leu His His Leu Gly Ser Val Ala Gly Glu 
65 70 75 

Phe Val Val Asp Phe Val Arg Asp Gly Pro Ala Met 
80 85 



(2) INFORMATION FOR SEQ ID NO: 49: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 nucleic acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 



GGATGGAGTT CGTCTGGATC GCTGTGCGCT ACGCGACGTG GTTTAAGCGT CATGGGTGCG 60 

25 CTTGGGTACA CGCCGGGGCA GTCGTTGGGC ATGTACTTGT GCGCCTTTGG TCTCGGCTGC 120 

ATTTACATTT TTCTGCAGTT CGCCGTAAGT CACACCCATT TGCCCGTGAG CAACCCGGAG 180 

GATCAGCTGC ATTGGCTCGA GTACGCGCGG ACCACACTGT GAACAT CAGC ACCAAGTCGT 240 

GGTTTGTCAC ATGGTGGATG TCGAACCTCA ACTTTCAGAT CGAGCACCAC CTTTTCCCCA -300 

CGGCGCCCCA GTTCCGTTTC AAGG AG AT C A GCCCGCGCGT CGAGGCCCTC TTCAAGCGCC 360 

30 ACGGTCTCCC TTACTACGAC ATGCCCTACA CGAGCGCCGT CTCCACCACC TTTGCCAACC 420 

TCTACTCCGT CGGCCATTCC GTCGGCGACG CCAAGCGCGA CTAGCCTCTT TTCCTAGACC 4 80 

TTAATTCCCC ACCCCACCCC ATGTTCTGTC TTCCTCCCGC 520 

35 (2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 

40 (C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 



50 



Met 


Glu 


Phe 


Val 


Trp 


He 


Ala 


Val 


Arg 


Tyr 


Ala 


Thr 


Trp 


Phe 


Lys 


1 








5 










10 










15 


Arg 


His 


Gly 


Cys 


Ala 


Trp 


Val 


His 


Ala 


Gly 


Ala 


Val 


Val 


Gly 


His 










20 










25 








30 


Val 


Leu 


Val 


Arg 


Leu 


Trp 


Ser 


Arg 


Leu 


His 


Leu 


His 


Phe 


Ser 


Ala 










35 










40 










45 


Val 


Arg 


Arg 


Lys 


Ser 


His 


Pro 


Phe 


Ala 


Arg 


Glu 


Gin 


Pro Gly Gly 










50 










55 










60 


Ser 


Ala 


Ala 


Leu 


Ala 


Arg 


Val 


Arg 


Ala 


Asp 


His 


Thr 


Val 


Asn 


He 


Ser 








65 










70 










75 


Thr 


Lys 


Ser 


Trp 


Phe 


Val 


Thr 


Trp 


Trp 


Met 


Ser 


Asn 


Leu 


Asn 










80 










85 










90 


Phe 


Gin 


He 


Glu 


His 


His 


Leu 


Phe 


Pro 


Thr 


Ala 


Pro 


Gin 


Phe 


Arg 


Phe 








95 










100 










105 


Lys 


Glu 


He 


Ser 


Pro 


Arg 


Val 


Glu 


Ala 


Leu 


Phe 


Lys 


Arg 


His 


Gly 








110 










115 






120 


Leu 


Pro 


Tyr 


Tyr 


Asp 


Met 


Pro 


Tyr 


Thr 


Ser 


Ala 


Val 


Ser 


Thr 


Thr 


Phe 






125 










130 










135 


Ala 


Asn 


Leu 


Tyr 


Ser 


Val 


Gly 


His 


Ser 


Val 


Gly Asp Ala 
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140 145 150 

Lys Arg Asp 



(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 429 nucleic acids 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 



15 



(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:51: 



ACGCGTCCGC CCACGCGTCC GCCGCGAGCA ACTCATCAAG GAAGGCTACT TTGACCCCTC 60 

20 GCTCCCGCAC ATGACGTACC GCGTGGTCGA GATTGTTGTT CTCTTCGTGC TTTCCTTTTG 120 

GCTGATGGGT CAGTCTTCAC CCCTCGCGCT CGCTCTCGGC ATTGTCGTCA GCGGCATCTC 180 

TCAGGGTCGC TGCGGCTGGG TAATGCATGA GATGGGCCAT GGGTCGTTCA CTGGTGTCAT 24 0 

TTGGCTTGAC GACCGGTTGT GCGAGTTCTT TTACGGCGTT GGTTGTGGCA TGAGCGGTCA 300 

T TACT GG AAA AACCAGCACA GCAAACACCA CGCAGCGCCA AACCGGCTCG AGCACGATGT 360 

25 AGATCTCAAC ACCTTGCCAT TGGTGGCCTT CAACGAGCGC GTCGTGCGCA AGGTCCGACC 420 



30 



(2) INFORMATION FOR SEQ ID NO: 52: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 
35 <D) TOPOLOGY: linear 



40 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 





Arg 


Val Arg 


Pro 


Arg 


Val 


Arg 


Arg 


Glu 


Gin 


Leu 


He 


Lys 


Glu Gly 




1 






5 










10 








15 




Tyr 


Phe Asp 


Pro 


Ser 


Leu 


Pro 


His 


Met 


Thr 


Tyr 


Arg 


Val 


Val Glu 


45 








20 










25 








30 




He 


Val Val 


Leu 


Phe 
35 


Val 


Leu 


Ser 


Phe 


Trp 
40 


Leu 


Met 


Gly 


Gin Ser 
45 




Ser 


Pro Leu 


Ala 


Leu 


Ala 


Leu 


Gly 


He 


Val 


Val 


Ser 


Gly 


He Ser 


50 








50 










55 








60 


Gin Gly Arg 


Cys 


Gly 


Trp 


Val 


Met 


His 


Glu 


Met 


Gly 


His 


Gly Ser 










65 










70 








75 




Phe 


Thr Gly 


Val 


He 
65 


Trp 


Leu 


Asp 


Asp 


Arg 
70 


Leu 


Cys 


Glu 


Phe Phe 
75 


55 


Tyr 


Gly Val 


Gly 


Cys 


Gly 


Met 


Ser 


Gly 


His 


Tyr 


Trp 


Lys 


Asn Gin 








80 










85 








90 




His 


Ser Lys 


His 


His 
95 


Ala 


Ala 


Pro 


Asn 


Arg 
100 


Leu 


Glu 


His 


Asp Val 
105 




Asp 


Leu Asn 


Thr 


Leu 


Pro 


Leu 


Val 


Ala 


Phe 


Asn 


Glu Arg 


Val Val 


60 








110 










115 








120 


Arg 


Lys Val 


Arg 


Pro 
125 
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What is claimed is : 

1 . A nucleic acid construct comprising: 

One or more nucleotide sequences depicted in a SEQ ID NO: selected from 
the group consisting of SEQ ID NO:l, SEQ ID NO:3 and SEQ ID NO:5, 
5 wherein said one or more nucleotide sequences is linked to a heterologous 

nucleotide sequence. 

2. A nucleic acid construct comprising: 

One or more nucleotide sequences depicted in a SEQ ID NO: selected from 
1 0 the group consisting of SEQ ID NO: 1 , SEQ ID NO:3 and SEQ ID NO:5, 

wherein said one or more nucleotide sequences is operably associated with an 
expression control sequence functional in a plant cell. 

3. The nucleic acid construct according to claim 2, wherein said nucleotide 
1 5 sequence has an average A + T content of less than about 60%. 

4. The nucleic acid construct according to claim 2, wherein said nucleotide 
sequence is derived from a fungus. 

20 5. The nucleic acid construct according to claim 4, wherein said fungus is of 

the genus Mortierella 

6. The nucleic acid construct according to claim 5, wherein said fungus is of 
the species alpina. 



25 



7. A nucleic acid construct comprising: 

A nucleotide sequence which encodes a polypeptide comprising an amino 
acid sequence depicted in SEQ ID NO:2, wherein said nucleotide sequence is 
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operably associated with a transcription or an expression control sequence 
function in a plant cell, wherein said nucleotide sequence encodes a functionally 
active polypeptide which desaturates a fatty acid molecule at carbon 6 from the 
carboxyl end of said fatty acid molecule. 

5 

8. A nucleic acid construct comprising: 

A nucleotide sequence which encodes a polypeptide comprising an 
amino acid sequence depicted in SEQ ID NO:4, wherein said nucleotide 
sequence is operably associated with a transcription or an expression control 
10 sequence functional in a plant cell, wherein said nucleotide sequence encodes a 

functionally active polypeptide which desaturates a fatty acid molecule at 
carbon 12 from the carboxyl end of said fatty acid molecule. 



9. A nucleic acid construct comprising: 

15 A nucleotide sequence which encodes a polypeptide comprising an 

amino acid sequence depicted in SEQ ID NO:6, wherein said nucleotide 
sequence is operably associated with a transcription or an expression control 
sequence function in a plant cell, wherein said nucleotide sequence encodes a 
functionally active polypeptide which desaturates a fatty acid molecule at 

20 carbon 5 from the carboxyl end of said fatty acid moleculle. 



10. A nucleic acid construct comprising: 

at least one nucleotide sequence which encodes a functionally active 
desaturase having an amino acid sequence depicted in a SEQ ID NO: selected 
25 from the group consisting of SEQ ID NO:2, SEQ ID NO:4 and SEQ ID NO:6, 

wherein said nucleotide sequence is operably associated with a promoter 
functional in a plant cell. 
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11. The nucleic acid construct according to claim 10, wherein said plant cell is a 
seed cell. 

12. The nucleic acid construct according to claim 11, wherein said seed cell is 
5 an embryo cell. 



13. A recombinant plant cell comprising: 

At least one copy of a DNA sequence which encodes at least one 
functionally active Mortierella alpina fatty acid desaturase which results in the 

10 production of a polyunsaturated fatty acid, wherein said fatty acid desaturase 

has an amino acid sequence as depicted in a SEQ ID NO: selected from the 
group consisting of SEQ ID NO:2, SEQ ID NO:4, and SEQ ID NO:6, wherein 
said cell was transformed with a vector comprising said DNA sequence, and 
wherein said DNA sequence is operably associated with an expression control 

15 sequence. 



20 



14. The recombinant plant cell of claim 13, wherein said polyunsaturated fatty 
acid is selected from the group consisting of LA, ARA, GLA, DGLA, SDA 
and EPA. 



15. The recombinant plant cell of claim 13, wherein said recombinant plant cell 
is enriched in a fatty acid selected from the group consisting of 18: 1, 18:2, 
18:3 and 18:4. 



25 16. The recombinant plant cell of claim 15, wherein said plant cell is selected 

from the group consisting of Brassica, soybean, safflower, corn, flax, and 
sunflower. 
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17. The recombinant plant cell according to claim 16, wherein said expression 
control sequence is endogenous to said plant cell. 

18. One or more plant oils expressed by said recombinant plant cell of claim 16. 

5 

19. A method for obtaining altered long chain polyunsaturated fatty acid 
biosynthesis comprising the steps of: 

growing a plant having cells which contain a transgene encoding a 
transgene expression product which desaturates a fatty acid molecule at carbon 
10 5 from the carboxyl end of said fatty acid molecule, wherein said transgene is 

operably associated with an expression control sequence, under conditions 
whereby said transgene is expressed, whereby long chain polyunsaturated fatty 
acid biosynthesis in said cells is altered. 

15 20. A method for obtaining altered long chain polyunsaturated fatty acid 

biosynthesis comprising the steps of: 

growing a plant having cells which contain one or more transgenes, 
derived from a fungus or algae, which encodes a transgene expression product 
which desaturates a fatty acid molecule at a carbon selected from the group 
20 consisting of carbon 5, carbon 6 and carbon 12 from the carboxyl end of said 

fatty acid molecule, wherein said one or more transgenes is operably associated 
with an expression control sequence, under conditions whereby said one or 
more transgenes is expressed, whereby long chain polyunsaturated fatty acid 
biosynthesis in said cells is altered. 

25 

21. The method according to claims 19 or 20, wherein said long chain 
polyunsaturated fatty acid is selected from the group consisting of LA, ARA, 
GLA, DGLA, SDA and EPA. 
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22. A plant oil or fraction thereof produced according to the method of claims 
19 or 20. 

23. A method of treating or preventing malnutrition comprising administering 

5 said plant oil of claim 22 to a patient in need of said treatment or prevention 

in an amount sufficient to effect said treatment or prevention. 

24. A pharmaceutical composition comprising said plant oil or fraction of claim 
22 and a pharmaceutically acceptable carrier. 

10 

25. The pharmaceutical composition of claim 24, wherein said pharmaceutical 
composition is in the form of a solid or a liquid. 

26. The pharmaceutical composition of claim 25, wherein said pharmaceutical 
1 5 composition is in a capsule or tablet form. 

27. The pharmaceutical composition of claim 24 further comprising at least one 
nutrient selected from the group consisting of a vitamin, a mineral, a 
carbohydrate, a sugar, an amino acid, a free fatty acid, a phospholipid, an 

-0 antioxidant, and a phenolic compound. 

28. A nutritional formula comprising said plant oil or fraction thereof of claim 
22. 

5 29. The nutritional formula of claim 28, wherein said nutritional formula is 

selected from the group consisting of an infant formula, a dietary 
supplement, and a dietary substitute. 



-179- 



PCT/US98/07421 



30. The nutritional formula of claim 29, wherein said infant formula, dietary 
supplement or dietary supplement is in the form of a liquid or a solid. 

3 1 . An infant formula comprising said plant oil or fraction thereof of claim 22. 

5 

32. The infant formula of claim 31 further comprising at least one macronutrient 
selected from the group consisting of coconut oil, soy oil, canola oil, mono- 
and diglycerides, glucose, edible lactose, electrodialysed whey, 
electrodialysed skim milk, milk whey, soy protein, and other protein 

10 hydrolysates. 

33. The infant formula of claim 32 further comprising at least one vitamin 
selected from the group consisting of Vitamins A, C, D, E, and B complex; 
and at least one mineral selected from the group consisting of calcium, 

15 magnesium, zinc, manganese, sodium, potassium, phosphorus, copper, 

chloride, iodine, selenium, and iron. 

34. A dietary supplement comprising said plant oil or fraction thereof of claim 
22. 

20 

35. The dietary supplement of claim 34 further comprising at least one 
macronutrient selected from the group consisting of coconut oil, soy oil, 
canola oil, mono- and diglycerides, glucose, edible lactose, electrodialysed 
whey, electrodialysed skim milk, milk whey, soy protein, and other protein 

25 hydrolysates. 

36. The dietary supplement of claim 35 further comprising at least one vitamin 
selected from the group consisting of Vitamins A, C, D, E, and B complex; 
and at least one mineral selected from the group consisting of calcium, 
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magnesium, zinc, manganese, sodium, potassium, phosphorus, copper, 
chloride, iodine, selenium, and iron. 

37. The dietary supplement of claim 34 or claim 36, wherein said dietary 
5 supplement is administered to a human or an animal. 

38. A dietary substitute comprising said plant oil or fraction thereof of claim 22. 

39. The dietary substitute of claim 38 further comprising at least one 

10 macronutrient selected from the group consisting of coconut oil, soy oil, 

canola oil, mono- and diglycerides, glucose, edible lactose, electrodialysed 
whey, electrodialysed skim milk, milk whey, soy protein, and other protein 
hydrolysates. 

15 40. The dietary substitute of claim 39 further comprising at least one vitamin 

selected from the group consisting of Vitamins A, C, D, E, and B complex; 
and at least one mineral selected from the group consisting of calcium, 
magnesium, zinc, manganese, sodium, potassium, phosphorus, copper, 
chloride, iodine, selenium, and iron. 

20 

41. The dietary substitute of claim 38 or claim 40, wherein said dietary 
substitute is administered to a human or animal. 

42. A method of treating a patient having a condition caused by insuffient 

25 intake or production of polyunsaturated fatty acids comprising administering 

to said patient said dietary substitute of claim 38 or said dietary supplement 
of claim 34 in an amount sufficient to effect said treatment. 
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43. The method of claim 42, wherein said dietary substitute or said dietary 
supplement is administered enterally or parenterally. 

44. A cosmetic comprising said plant oil or fraction thereof of claim 22. 

5 

45. The cosmetic of claim 44, wherein said cosmetic is applied topically. 

46. The pharmaceutical composition of claim 24, wherein said pharmaceutical 
composition is administered to a human or an animal. 

10 

47. An animal feed comprising said plant oil or fraction thereof of claim 22. 

48. An isolated nucleotide sequence comprising the nucleotide sequence 
selected from the group consisting of SEQ ID NO:38 - SEQ ID NO:44 

15 wherein said nucleotide sequence is expressed in a plant cell. 

49. The method of claim 20 wherein said fungus is Mortierella species. 

50. The method of claim 49 wherein said fungus is Mortierella alpina. 

20 

51. An isolated nucleotide sequence selected from the group consisting of SEQ 
ID NO:49 - SEQ ID NO:50 wherein said sequence is expressed in a plant 
cell. 
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FastA Match of ma29 and contig 253538a 

SCORES Initl: 117 Initn: 225 Opt: 256 

Smith-Waterman score: 408; 27.0% identity in 441 aa overlap 

10 20 30 40 50 

ma29gcg.pep MGTDQGKT FTWEELAAHNTKDDLL1AIRGRVYDVTKFLSRHPGGVDTLLLGAGRDVT 

If I : : I : I : I I : : : : I 11(11 : : I I : I : I 

253538a QGPTPRYFTWDEVAQRSGCEERWLVIDRKVYNISEFTRRHPGGSRVISHYAGQDAT 

10 20 30 40 50 



ma29gcg.pep 



253538a 



60 70 80 90 100 110 

PVFEMYHAF-GAADAIMKKYYVGTLVSNELPIFPEPTVFHKTIKTRVEGYFTDRNIDPKN 

I ; J I ..L U;. .1.1 LJ...J, I I III .:: . I I : : : 

DPFVAFHINKGLVKKYMNSLLIG^ 



60 



70 



80 



90 



100 



110 



120 130 140 150 160 170 

ma29gcg.pep RPEIWGRYALIFGSLIASYYAQLFVPFWERTWLQWF-AIIMGFACAQVGLNPLHDASH 
: : : : I : : : I : : : I : : I : : I : : : : : I I : I I I : I 
253538a ANHVF — FLLYLLHILLLDGAAWLTLWVFGTSFLPFLLCAVLLSAVQAQAGWLQ-HDYGH 

120 130 140 150 160 170 

180 190 200 210 220 

ma29gcg.pep FSVTHNPTVWKI LGATHDF FNGASYLVWMYQHMLGHHPYTNIAGADPDVSTSE 

: I I : : I I : I :l I : : I I I I : : I : I I II MM: : 
253538a LSVYRKPK-WNHL — VHKFVIGHLKGASAbJWWNHRH- FQHHAKPNI FHKDPDVNMLHVFV 

180 190 200 210 220" 

230 240 250 260 270 280 

ma29gcg.pep PDVRRIKPNQKWF-VNHINQHMFV — PFLYGLLAFKVRIQDINILYFVKTNDAIRV 

• • j.j.j |{::::|: || : j • : | | • : : :•:: 

253538a LGEWQPIEYGKKKLKYLPYNHQHEYFFLIGPPLLIPMYFQYQI IMTMIVHKNWVDL 

230 240 250 260 270 280 

290 300 310 320 330 340 

ma29gcg.pep NPISTWHTVMFWGGKAFFVWYRLIVPLQYLPLGKVLLLFTVADMVSSYWLALTFQANHW 
: I : : : M : I : I : I I I : I I : : : : : I : I : : : I I I : I 

253538a AWAVSYYI RFFITY IPF-YGILG-ALLFLNFIRFLESHWFVWVTQMNHIV 

290 300 310 320 330 

350 360 370 380 390 

ma29gcg . pep EEVQWPLPDENG 1 1 QKDWAAMQVETT QDYAHDSHLWTSITGSLNYQAVHHLFPNVS 

I : I : : : Ml : I : : I I : : : I II I I I : I I I I I I : : 

253538a ME I DQEAY — RDWFSSQLTATCNVEQS FFND WFS — GHLNFQIEHHLFPTMP 

340 350 360 370 



400 410 420 430 440 

ma29gcg.pep QHHYPDILAIIKNTCSEYKVPYLVKDTFWQAFASHLEHLRVLGLRPKEEX 

: I : I : : I : I : : : : I I 
253538a RHNLHKIAPLVKSLCAKHGIEYQEKPLLRALLDIIRSLKKSGKLWLDAYLHKX 
380 390 400 410 420 430 
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FastA Match of ma524 and contig 253538a 



SCORES Initl: 231 Initn: 499 Opt: 401 

Smith-Waterman score: 620; 27.3% identity in 455 aa overlap 

10 20 30 40 50 59 

ma524acq pep MAAAPSWTFTRAEVLNAEALNEGK^ 

|: | || || ::::: -11 III" 11= I I I I I — I 

253538a QGPTPRYFTWDEV AQRSGCEERWLVIDRKVYNISEFTRRHPGGSRVISHY 

10 20 30 40 50 

60 70 80 90 100 110 

ma524gcg . pep VGKDGTD7FDTFHPEAAW — ETLANFYVGDIDE SDRDIKNDDFAAEVRKLRTLFQSL 

.:j.:|:Jl | :|( . .U:^-v4^ - J4-.? ^ .L.J/ .? . 

''253538a iiGQEATO^ 

60 70 80 90 100 110 

120 130 140 150 160 170 

ma524gcg.pep CTYDSSKAYYAFKVSFI^ 

| :::::::: : | : | : : : : | | | | : I I : I I : I III I I : 
2 53 53 8a GLMKANHtfFTLLYLL^^ 

120 130 140 150 160 

180 190 200 210 220 230 

ma52 4gcg . pep LHHQWQDRFWGDLFGAFLGGVC^ 

| :|:: |: | |: | :| |::||: :| III lh =111 = = : l 

2 5 3 53 8a GHLSVYRKPKWNHLVHKFVniGHLK^ ML 

170 180 190 200 210 220 

240 250 260 270 280 290 

ma524gcg.pep EHAI£MFSDVPDEELTRM^^^ 

Is: :::: | : ::: |: II === : : : | |: :: =1 

253538a -HVF-V^EWQPIEYGKKKrJCYLPYOTQHEYFFLIG VH 

230 240 250 260 270 

300 310 320 330 340 349 

ma524gcg.pep KPSGARVPISLVEQLSIAMHlftT[WYIATMFL»FIK — DPVNMLVYFLVSQAVCGNLLAIVFS 
I : ::|| ::|: :: :| :: |::: :::::: | : 

253538a K NWVDIAWAVSYYIRFFITYIPFYGILGALLFU^ 

280 290 300 310 320 

350 360 370 380 390 400 409 

ma524gcg.pep LNHNGMFVTSKEEAVDMDFFTK^ 

: | | | : : | | | : | : : | : : : | : : : I = I I : I I I : I I I I I I I h I I I I I 5 
253538a MNHIVMEI--DQEAYR-I^SSQLTATa^ 

330 340 350 360 370 380 

410 420 430 440 450 

ma524gcg . pep SKIQPAVETLCKKYNVRYHTTGM^ 

| | | | : : | | | : : : : | : : : : : : : : h = = I 
253538a HKIAPLVKSLCAKHGIEYQEKPIXRAIJ^ 

390 400 410 420 430 
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METHODS AND COMPOSITIONS FOR SYNTHESIS OF 
LONG CHAIN POLYUNSATURATED FATTY ACIDS IN PLANTS 

CROSS-REFERENCE TO RELATED APPLICATIONS 

This application is a continuation-in-part of USSN 08/834,655, filed 
5 April 11, 1997, and a continuation in part of USSN 08/833,610, filed April 11, 
1997, USSN 08/834,033 filed April 1 1, 1997 and USSN 08/956,985 filed 
October 24, 1997 which disclosures are incorporated herein by reference. 

INTRODUCTION 

Field of the Invention 

10 This invention relates to modulating levels of enzymes and/or enzyme 

components capable of altering the production of long chain polyunsaturated 
fatty acids (PUFAS) in a host plant. The invention is exemplified by the 
production of PUFAS in plants. 

Background 

15 Two main families of polyunsaturated fatty acids (PUFAs) are the co3 

fatty acids, exemplified by arachidonic acid, and the co6 fatty acids, exemplified 
by eicosapentaenoic acid. PUFAs are important components of the plasma 
membrane of the cell, where they may be found in such forms as phospholipids. 
PUFAs also serve as precursors to other molecules of importance in human 

20 beings and animals, including the prostacyclins, leukotrienes and 

prostaglandins. PUFAs are necessary for proper development, particularly in 
the developing infant brain, and for tissue formation and repair. 

Four major long chain PUFAs of importance include docosahexaenoic 
acid (DHA) and eicosapentaenoic acid (EPA), which are primarily found in 
25 different types of fish oil, gamma-linolenic acid (GLA), which is found in the 
seeds of a number of plants, including evening primrose {Oenothera biennis), 
borage (Borago officinalis) and black currants (Ribes nigrum), and stearidonic 
acid (SDA), which is found in marine oils and plant seeds. Both GLA and 
another important long chain PUFA, arachidonic acid (ARA), are found in 
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filamentous fungi. ARA can be purified from animal tissues including liver and 
adrenal gland. 

For DHA, a number of sources exist for commercial production 
including a variety of marine organisms, oils obtained from cold water marine 
5 fish, and egg yolk fractions. For ARA, microorganisms including the genera 
Mortierella, Entomophthora, Phytium and Porphyridium can be used for 
commercial production. Commercial sources of SDA include the genera 
Trichodesma and Echium. Commercial sources of GLA include evening 
primrose, black currants and borage. However, there are several disadvantages 
10 associated with commercial production of PUFAs from natural sources. Natural 
sources of PUFAs, such as animals and plants, tend to have highly 
heterogeneous oil compositions. The oils obtained from these sources therefore 
can require extensive purification to separate out one or more desired PUFAs or 
to produce an oil which is enriched in one or more PUFA. Natural sources also 
15 are subject to uncontrollable fluctuations in availability. Fish stocks may 
undergo natural variation or may be depleted by overfishing. Fish oils have 
unpleasant tastes and odors, which may be impossible to economically separate 
from the desired product, and can render such products unacceptable as food 
supplements. Animal oils, and particularly fish oils, can accumulate 
20 environmental pollutants. Weather and disease can cause fluctuation in yields 
from both fish and plant sources. Cropland available for production of alternate 
oil-producing crops is subject to competition from the steady expansion of 
human populations and the associated increased need for food production on the 
remaining arable land. Crops which do produce PUFAs, such as borage, have 
25 not been adapted to commercial growth and may not perform well in 

monoculture. Growth of such crops is thus not economically competitive where 
more profitable and better established crops can be grown. Large scale 
fermentation of organisms such as Mortierella is also expensive. Natural 
animal tissues contain low amounts of ARA and are difficult to process. 
30 Microorganisms such as Porphyridium and Mortierella are difficult to cultivate 
on a commercial scale. 
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Dietary supplements and pharmaceutical formulations containing 
PUFAs can retain the disadvantages of the PUFA source. Supplements such as 
fish oil capsules can contain low levels of the particular desired component and 
thus require large dosages. High dosages result in ingestion of high levels of 
5 undesired components, including contaminants. Care must be taken in 

providing fatty acid supplements, as overaddition may result in suppression of 
endogenous biosynthetic pathways and lead to competition with other necessary 
fatty acids in various lipid fractions in vivo, leading to undesirable results. For 
example, Eskimos having a diet high in ©3 fatty acids have an increased 
1 0 tendency to bleed (U.S. Pat. No. 4,874,603). Unpleasant tastes and odors of the 
supplements can make such regimens undesirable, and may inhibit compliance 
by the patient. 

A number of enzymes are involved in PUFA biosynthesis. Linoleic acid 
(LA, 18:2 A9, 12) is produced from oleic acid (18:1 A9) by a A12-desaturase. 

15 GLA (18:3 A6, 9, 12) is produced from linoleic acid (LA, 1 8:2 A9, 12) by a A6- 
desaturase. ARA (20:4 A5, 8, 1 1, 14) production from DGLA (20:3 A8, 1 1, 14) 
is catalyzed by a A5-desaturase. However, animals cannot desaturate beyond 
the A9 position and therefore cannot convert oleic acid (18:1 A9) into linoleic 
acid (18:2 A9, 12). Likewise, a-linolenic acid (ALA, 18:3 A9, 12, 15) cannot 

20 be synthesized by mammals. Other eukaryotes, including fungi and plants, have 
enzymes which desaturate at positions A21 and A 15. The major poly- 
unsaturated fatty acids of animals therefore are either derived from diet and/or 
from desaturation and elongation of linoleic acid (1 8:2 A9, 12) or oc-linolenic 
acid (18:3 A9, 12, 15). 

*5 Poly-unsaturated fatty acids are considered to be useful for nutritional, 

pharmaceutical, industrial, and other purposes. An expansive supply of poly- 
unsaturated fatty acids from natural sources and from chemical synthesis are not 
sufficient for commercial needs. Therefore it is of interest to obtain genetic 
material involved in PUFA biosynthesis from species that naturally produce 

*0 these fatty acids and to express the isolated material alone or in combination in 
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a heterologous system which can be manipulated to allow production of 
commercial quantities of PUFAS. 

The present invention is further directed to formulas, dietary 
supplements or dietary supplements in the form of a liquid or a solid containing 
5 the long chain fatty acids of the invention. These formulas and supplements 
may be administered to a human or an animal. 

The formulas and supplements of the invention may further comprise at 
least one macronutrient selected from the group consisting of coconut oil, soy 
oil, canola oil, mono- and diglycerides, glucose, edible lactose, electrodialysed 
10 whey, electrodialysed skim milk, milk whey, soy protein, and other protein 
hydrolysates. 

The formulas of the present invention may further include at least one 
vitamin selected from the group consisting of Vitamins A, C, D, E, and B 
complex; and at least one mineral selected from the group consisting of 
15 calcium, magnesium, zinc, manganese, sodium, potassium, phosphorus, copper, 
chloride, iodine, selenium, and iron. 

The present invention is further directed to a method of treating a patient 
having a condition caused by insuffient intake or production of polyunsaturated 
fatty acids comprising administering to the patient a dietary substitute of the 
20 invention in an amount sufficient to effect treatment of the patient. 

The present invention is further directed to cosmetic and pharmaceutical 
compositions of the material of the invention. 

The present invention is further directed to transgenic oils in 
pharmaceutical^ acceptable carriers. The present invention is further directed 
25 to nutritional supplements, cosmetic agents and infant formulae containing 
transgenic oils. 

The present invention is further directed to a method for obtaining 
altered long chain polyunsaturated fatty acid biosynthesis comprising the steps 
of: growing a microbe having cells which contain a transgene which encodes a 
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transgene expression product which desaturates a fatty acid molecule at carbon 
5,5 or 12 from the carboxyl end of said fatty acid molecule, wherein the 
trangene is operably associated with an expression control sequence, under 
conditions whereby the transgene is expressed, whereby long chain 
5 polyunsaturated fatty acid biosynthesis in the cells is altered. 

The present invention is further directed toward pharmaceutical 
compositions comprising at least one nutrient selected from the group consisting 
of a vitamin, a mineral, a carbohydrate, a sugar, an amino acid, a free fatty acid, 
a phospholipid, an antioxidant, and a phenolic compound. 

10 Relevant Literature 

Production of gamma-linolenic acid by a A6-desaturase is described in 
USPN 5,552,306 and USPN 5,614,393. Production of 8, 1 1-eicosadienoic acid 
using Mortierella alpina is disclosed in USPN 5,376,541. Production of 
docosahexaenoic acid by dinoflagellates is described in USPN 5,407,957. 

1 5 Cloning of a A6-desaturase from borage is described in PCT publication WO 
96/21022. Cloning of A9-desaturases is described in the published patent 
applications PCT WO 91/13972, EP 0 550 162 Al, EP 0 561 569 A2, EP 0 644 
263 A2, and EP 0 736 598 Al, and in USPN 5,057,419. Cloning of A12- 
desaturases from various organisms is described in PCT publication WO 

20 94/1 1516 and USPN 5,443,974. Cloning of A15-desaturases from various 

organisms is described in PCT publication WO 93/1 1245. A A6 palmitoyl-acyl 
carrier protein desaturase from Thumbergia alata and its expression in E. coli is 
described in USPN 5,614,400. Expression of a soybean stearyl-ACP desaturase 
in transgenic soybean embryos using a 35S promoter is disclosed in USPN 

25 5,443,974. 

SUMMARY OF THE INVENTION 

Novel compositions and methods are provided for preparation of poly- 
unsaturated long chain fatty acids and desaturases in plants and plant cells. The 
methods involve growing a host plant cell of interest transformed with an 
30 expression cassette functional in a host plant cell, the expression cassette 
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comprising a transcriptional and trans lational initiation regulatory region, joined 
in reading frame 5' to a DNA sequence encoding a desaturase polypeptide 
capable of modulating the production of PUFAs. Expression of the desaturase 
polypeptide provides for an alteration in the PUFA profile of host plant cells as 
a result of altered concentrations of enzymes involved in PUFA biosynthesis. 
Of particular interest is the selective control of PUFA production in plant tissues 
and/or plant parts such as leaves, roots, fruits and seeds. The invention finds 
use for example in the large scale production of DHA, EPA, ARA, and GL A 
and for modification of the fatty acid profile of edible plant tissues and/or plant 
parts. 

The present invention further includes a purified nucleotide sequence or 
polypeptide sequence that is substantially related or homologous to the 
nucleotide and peptide sequences presented in SEQ ID NO:l - SEQ ID NO:52. 
The present invention is further directed to methods of using the sequences 
presented in SEQ ID NO:l to SEQ ID NO:40 as probes to identify related 
sequences, as components of expression systems and as components of systems 
useful for producing transgenic oil. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows possible pathways for the synthesis of arachidonic acid 
20 (20:4 A5, 8, 1 1, 14) and stearidonic acid (18:4 A6, 9, 12, 15) from palmitic acid 
(Ci6) from a variety of organisms, including algae, Mortierella and humans. 
These PUFAs can serve as precursors to other molecules important for humans 
and other animals, including prostacyclins, leukotrienes, and prostaglandins, 
some of which are shown. 

25 Figure 2 shows possible pathways for production of PUFAs in addition 

to ARA, including EPA and DHA, again compiled from a variety of organisms. 

Figure 3A-E shows the DNA sequence (SEQ ID NO:l) of the 
Mortierella alpina A6 desaturase and the deduced amino acid sequence (SEQ 
ID NO:2). 

-6- 
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Figure 4 shows an alignment of the Mortierella alpina A6 desaturase 
amino acid sequence with other A6 desaturases and related sequences (SEQ ID 
NOS:7, 8,9, 10, 11, 12 and 13). 

Figure 5A-D shows the DNA sequence of the Mortierella alpina A12 
5 desaturase (SEQ ID NO:3) and the deduced amino acid sequence (SEQ ID 
NO:4) 

Figure 6 shows the deduced amino acid sequence (SEQ ID NO: 14) of 
the PCR fragment (see Example 1). 

Figure 7A-D shows the DNA sequence of the Mortierella alpina A5 
1 0 desaturase (SEQ ID NO:5). 

Figure 8 shows alignments of the protein sequence of the A5 desaturase 
(SEQ ID NO:6) with A6 desaturases and related sequences (SEQ ID NOS:15, 
16, 17, 18). 

Figure 9 shows alignments of the protein sequence of the Ma 29 and 
15 contig 253538a. 

Figure 10 shows alignments of the protein sequence of Ma 524 and 
contig 253538a. 

BRIEF DESCRIPTION OF THE SEQUENCE LISTINGS 
SEQ ID NO: 1 shows the DNA sequence of the Mortierella alpina A6 
20 desaturase. 

SEQ ID NO:2 shows the amino acid sequence of the Mortierella alpina 
A6 desaturase. 

SEQ ID NO:3 shows the DNA sequence of the Mortierella alpina A12 
desaturase. 

25 SEQ ID NO:4 shows the amino acid sequence of the Mortierella alpina 

A12 desaturase. 
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SEQ ID NO :5 shows the DNA sequence of the Mortierella alpina A5 
desaturase. 

SEQ ID NO:6 shows the amino acid sequence Mortierella alpina A5 
desaturase. 

5 SEQ ID NO:7 - SEQ ID NO: 1 3 show amino acid sequences that relate 

to Mortierella alpina A6 desaturase. 

SEQ ID NO: 14 shows an amino acid sequence of a PCR fragment of 
Example 1. 

SEQ ID NO: 15 - SEQ ID NO: 18 show amino acid sequences that relate 
10 to Mortierella alpina A5 and A6 desaturases. 

SEQ ID NO: 19 - SEQ ID NO:30 show PCR primer sequences. 

SEQ ID NO:31 - SEQ ID NO:37 show human nucleotide sequences. 

SEQ ID NO:38 - SEQ ID NO:44 show human peptide sequences. 

SEQ ID NO:45 - SEQ ID NO:46 show the nucleotide and amino acid 
1 5 sequence of a Dictyostelium discoideium desaturase. 

SEQ ID NO:47 - SEQ ID NO:50 show the nucleotide and deduced 
amino acid sequence of a Schizochytrium cDNA clone. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

In order to ensure a complete understanding of the invention, the 
20 following definitions are provided: 

A5-Desaturase: A5 desaturase is an enzyme which introduces a double 
bond between carbons 5 and 6 from the carboxyl end of a fatty acid molecule. 

A6-Desaturase: A6-desaturase is an enzyme which introduces a double 
bond between carbons 6 and 7 from the carboxyl end of a fatty acid molecule. 

25 A9-Desaturase: A9-desaturase is an enzyme which introduces a double 

bond between carbons 9 and 10 from the carboxyl end of a fatty acid molecule. 
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A12-Desaturase: A12-desaturase is an enzyme which introduces a 
double bond between carbons 12 and 13 from the carboxyl end of a fatty acid 
molecule. 

Fatty Acids: Fatty acids are a class of compounds containing a long 
5 hydrocarbon chain and a terminal carboxylate group. Fatty acids include the 
following: 



Fatty Acid 


12:0 


lauric acid 




16:0 


palmitic acid 


- 


16:1 


palmitoleic acid 




18:0 


stearic acid 




18:1 


oleic acid 


A9-18:l 


18:2 A5,9 


taxoleic acid 


A5,9-18:2 


18:2 A6,9 


6,9-octadecadienoic acid 


A6,9-18:2 


18:2 


iinoleic acid 


A9,12-18:2 (LA) 


18:3 A6,9,12 


gamma-linolenic acid 


A6,9,12-18:3 (GLA) 


18:3 A5,9,12 


pinolenic acid 


A5,9,12-18:3 


18:3 


alpha-linolenic acid 


A9,12,15-18:3 (ALA) 


18:4 


stearidonic acid 


A6,9,12,15-18:4 (SDA) 


20:0 


Arachidic acid 




20:1 


Eicoscenic Acid 




22:0 


behehic acid 




22:1 


erucic acid 




22:2 


Docasadienoic acid 




20:4o)6 


arachidonic acid 


A5,8, 11,14-20:4 (ARA) 


20:3 G>6 


o>6-eicosatrienoic 
dihomo-gamma linolenic 


A8,l 1,14-20:3 (DGLA) 


20:5 0)3 


Eicosapentanoic 
(Timnodonic acid) 


A5,8,l 1,14,17-20:5 (EPA) 


20:3 o>3 


0)3-eicosatrienoic 


Al 1,16,17-20:3 


20:4 0)3 


0)3 -eicosatetraenoic 


A8, 11,14,17-20:4 


22:5 0)3 


Docosapentaenoic 


A7,10,13,16,19-22:5 (o)3DPA) 


22:6 0)3 


Docosahexaenoic 
(cervonic acid) 


A4,7,10,13,16,19-22:6 (DHA) 


24:0 


Lignoceric acid 
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Taking into account these definitions, the present invention is directed to novel 
DNA sequences, DNA constructs, methods and compositions are provided 
which permit modification of the poly-unsaturated long chain fatty acid content 
of plant cells. Plant cells are transformed with an expression cassette 

5 comprising a DNA encoding a polypeptide capable of increasing the amount of 
one or more PUF A in a plant cell. Desirably, integration constructs may be 
prepared which provide for integration of the expression cassette into the 
genome of a host cell. Host cells are manipulated to express a sense or 
antisense DNA encoding a polypeptide(s) that has desaturase activity. By 

1 0 "desaturase'* is intended a polypeptide which can desaturate one or more fatty 
acids to produce a mono- or poly-unsaturated fatty acid or precursor thereof of 
interest. By "polypeptide" is meant any chain of amino acids, regardless of 
length or post-translational modification, for example, glycosylation or 
phosphorylation. The substrate(s) for the expressed enryme may be produced 

15 by the host cell or may be exogenously supplied. 

To achieve expression in a host cell, the transformed DNA is operably 
associated with transcriptional and translational initiation and termination 
regulatory regions that are functional in the host cell. Constructs comprising the 
gene to be expressed can provide for integration into the genome of the host cell 

20 or can autonomously replicate in the host cell. For production of linoleic acid 
(LA), the expression cassettes generally used include a cassette which provides 
for A12 desaturase activity, particularly in a host cell which produces or can 
take up oleic acid. For production of ALA, the expression cassettes generally 
used include a cassette which provides for A15 or to 3 desaturase activity, 

25 particularly in a host cell which produces or can take up LA. For production of 
GL A or SD A, the expression cassettes generally used include a cassette which 
provides for A6 desaturase activity, particularly in a host cell which produces or 
can take up LA or ALA, respectively. Production of 006-type unsaturated fatty 
acids, such as LA or GLA, is favored in a plant capable of producing ALA by 

30 inhibiting the activity of a A15 or ©3 type desaturase; this is accomplished by 
providing an expression cassette for an antisense A15 or co3 transcript, or by 
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disrupting a A15 or ©3 desaturase gene. Similarly, production of LA or ALA is 
favored in a plant having A6 desaturase activity by providing an expression 
cassette for an antisense A6 transcript, or by disrupting a A6 desaturase gene. 
Production of oleic acid likewise is favored in a plant having A12 desaturase 
5 activity by providing an expression cassette for an antisense A12 transcript, or 
by disrupting a A12 desaturase gene. For production of ARA, the expression 
cassette generally used provides for A5 desaturase activity, particularly in a host 
cell which produces or can take up DGLA. Production of ©6-type unsaturated 
fatty acids, such as ARA, is favored in a plant capable of producing ALA by 
1 0 inhibiting the activity of a A15 or ©3 type desaturase; this is accomplished by 
providing an expression cassette for an antisense A15 or <d3 transcript, or by 
disrupting a A15 or ©3 desaturase gene. 

TRANSGENIC PLANT PRODUCTION OF FATTY ACIDS 

Transgenic plant production of PUFAs offers several advantages over 

1 5 purification from natural sources such as fish or plants. Production of fatty 
acids from recombinant plants provides the ability to alter the naturally 
occurring plant fatty acid profile by providing new synthetic pathways in the 
host or by suppressing undesired pathways, thereby increasing levels of desired 
PUFAs, or conjugated forms thereof, and decreasing levels of undesired 

20 PUFAs. Production of fatty acids in transgenic plants also offers the advantage 
that expression of desaturase genes in particular tissues and/or plant parts means 
that greatly increased levels of desired PUFAs in those tissues and/or parts can 
be achieved, making recovery from those tissues more economical. For 
example, the desired PUFAs can be expressed in seed; methods of isolating 

25 seed oils are well established. In addition to providing a source for purification 
of desired PUFAs, seed oil components can be manipulated through expression 
of desaturase genes, either alone or in combination with other genes such as 
elongases, to provide seed oils having a particular PUFA profile in concentrated 
form. The concentrated seed oils then can be added to animal milks and/or 

30 synthetic or semi-synthetic milks to serve as infant formulas where human 
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nursing is impossible or undesired, or in cases of malnourishment or disease in 
both adults and infants. 

For production of PUFAs, depending upon the host cell, the availability 
of substrate, and the desired end produces), several polypeptides, particularly 
5 desaturases, are of interest including those polypeptides which catalyze the 
conversion of stearic acid to oleic acid, LA to GLA, of ALA to SDA, of oleic 
acid to LA, or of LA to ALA, which includes enzymes which desaturate at the 
A6, A9, A12, A15 or o3 positions. Considerations for choosing a specific 
polypeptide having desaturase activity include the pH optimum of the 
10 polypeptide, whether the polypeptide is a rate limiting enzyme or a component 
thereof, whether the desaturase used is essential for synthesis of a desired poly- 
unsaturated fatty acid, and/or co-factors required by the polypeptide. The 
expressed polypeptide preferably has parameters compatible with the 
biochemical environment of its location in the host cell. For example, the 
1 5 polypeptide may have to compete for substrate with other enzymes in the host 
cell. Analyses of the K m and specific activity of the polypeptide in question 
therefore are considered in determining the suitability of a given polypeptide for 
- modifying PUFA production in a given host cell. The polypeptide used in a 
particular situation therefore is one which can function under the conditions 
20 present in the intended host cell but otherwise can be any polypeptide having 
desaturase activity which has the desired characteristic of being capable of 
modifying the relative production of a desired PUFA. A scheme for the 
synthesis of arachidonic acid (20:4 A5, 8, 1 1, 14) from palmitic acid (Ci6) is 
shown in Figure 1 . A key enzyme in this pathway is a A5-desaturase which 
25 converts DH-y-linolenic acid (DGLA, eicosatrienoic acid) to ARA. Conversion 
of a-linolenic acid (ALA) to stearidonic acid by a A6-desaturase is also shown. 
Production of PUFAs in addition to ARA, including EPA and DHA is shown in 
Figure 2. A key enzyme in the synthesis of arachidonic acid (20:4 A5, 8, 1 1, 
14) from stearic acid (Cig) is a A6-desaturase which converts the linoleic acid 
30 into y-iinolenic acid. Conversion of a-linolenic acid (ALA) to stearidonic acid 
by a A6-desaturase also is shown. For production of ARA, the DNA sequence 
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used encodes a polypeptide having A5 desaturase activity. In particular 
instances, this can be coupled with an expression cassette which provides for 
production of a polypeptide having A6 desaturase activity and, optionally, a 
transcription cassette providing for production of antisense sequences to a A 15 
5 transcription product. The choice of combination of cassettes used depends in 
part on the PUFA profile of the host cell. Where the host cell A5-desaturase 
activity is limiting, overexpression of A5 desaturase alone generally will be 
sufficient to provide for enhanced ARA production. 



As sources of polypeptides having desaturase activity and 
oligonucleotides encoding such polypeptides are organisms which produce a 
desired poly-unsaturated fatty acid. As an example, microorganisms haying an 
ability to produce ARA can be used as a source of A5-desaturase genes; 

1 5 microorganisms which GLA or SDA can be used as a source of A6-desaturase 
and/or A12-desaturase genes. Such microorganisms include, for example, those 
belonging to the genera Mortierella, Conidiobolus, Pythium, Phytophathora, 
Penicillium, Porphyridium, Coidosporium, Mucor, Fusarium, Aspergillus, 
Rhodotorula, and Entomophthora. Within the genus Porphyridium, of 

20 particular interest is Porphyridium omentum. Within the genus Mortierella, of 
particular interest are Mortierella elongata, Mortierella exigua, Mortierella 
hygrophila, Mortierella ramanniana, var. angulispora, and Mortierella alpina. 
Within the genus Mucor, of particular interest are Mucor circinelloides and 
Mucor javanicus. 

25 DNAs encoding desired desaturases can be identified in a variety of 

ways. As an example, a source of the desired desaturase, for example genomic 
or cDNA libraries from Mortierella, is screened with detectable enzymatically- 
or chemically-synthesized probes, which can be made from DNA, RNA, or non- 
naturally occurring nucleotides, or mixtures thereof. Probes may be 

30 enzymatically synthesized from DNAs of known desaturases for normal or 



10 



SOURCES OF POLYPEPTIDES 
HAVING DESATURASE ACTIVITY 
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reduced-stringency hybridization methods. Oligonucleotide probes also can be 
used to screen sources and can be based on sequences of known desaturases, 
including sequences conserved among known desaturases, or on peptide 
sequences obtained from the desired purified protein. Oligonucleotide probes 

5 based on amino acid sequences can be degenerate to encompass the degeneracy 
of the genetic code, or can be biased in favor of the preferred codons of the 
source organism. Oligonucleotides also can be used as primers for PCR from 
reverse transcribed mRNA from a known or suspected source; the PCR product 
can be the full length cDNA or can be used to generate a probe to obtain the 

10 desired full length cDNA. Alternatively, a desired protein can be entirely 

sequenced and total synthesis of a DNA encoding that polypeptide performed. 

Once the desired genomic or cDNA has been isolated, it can be 
sequenced by known methods. It is recognized in the art that such methods are 
subject to errors, such that multiple sequencing of the same region is routine and 

1 5 is still expected to lead to measurable rates of mistakes in the resulting deduced 
sequence, particularly in regions having repeated domains, extensive secondary 
structure, or unusual base compositions, such as regions with high GC base 
content. When discrepancies arise, resequencing can be done and can employ 
special methods. Special methods can include altering sequencing conditions 

20 by using: different temperatures; different enzymes; proteins which alter the 
ability of oligonucleotides to form higher order structures; altered nucleotides 
such as ITP or methylated dGTP; different gel compositions, for example 
adding formamide; different primers or primers located at different distances 
from the problem region; or different templates such as single stranded DNAs. 

25 Sequencing of mRNA can also be employed. 

For the most part, some or all of the coding sequence for the polypeptide 
having desaturase activity is from a natural source. In some situations, 
however, it is desirable to modify all or a portion of the codons, for example, to 
enhance expression, by employing host preferred codons. Host preferred 
30 codons can be determined from the codons of highest frequency in the proteins 
expressed in the largest amount in a particular host species of interest. Thus, the 
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coding sequence for a polypeptide having desaturase activity can be 
synthesized in whole or in part. All or portions of the DNA also can be 
synthesized to remove any destabilizing sequences or regions of secondary 
structure which would be present in the transcribed mRNA. All or portions of 
the DNA also can be synthesized to alter the base composition to one more 
preferable in the desired host cell. Methods for synthesizing sequences and 
bringing sequences together are well established in the literature. In vitro 
mutagenesis and selection, site-directed mutagenesis, or other means can be 
employed to obtain mutations of naturally occurring desaturase genes to 
produce a polypeptide having desaturase activity in vivo with more desirable 
physical and kinetic parameters for function in the host cell, such as a longer 
half-life or a higher rate of production of a desired polyunsaturated fatty acid. 

Desirable cDNAs have less than 60% A+T composition, preferably less 
than 50% A+T composition. On a localized scale of a sliding window of 20 
base pairs, it is preferable that there are no localized regions of the cDNA with 
greater than 75% A+T composition; with a window of 60 base pairs, it is 
preferable that there are no localized regions of the cDNA with greater than 
60%, more preferably no localized regions with greater than 55% A+T 
composition. 

Mortierella alpina Desaturases 
Of particular interest are the Mortierella alpina A5-desaturase, A6- 
desaturase and A12-desaturase. The A5-desaturase has 446 amino acids; the 
amino acid sequence is shown in Figure 7. The gene encoding the Mortierella 
alpina A5-desaturase can be expressed in transgenic microorganisms to effect 
greater synthesis of ARA from DGLA. Other DNAs which are substantially 
identical in sequence to the Mortierella alpina A5-desaturase DNA, or which 
encode polypeptides which are substantially identical in sequence to the 
Mortierella alpina A5-desaturase polypeptide, also can be used. The 
Mortierella alpina A6-desaturase, has 457 amino acids and a predicted 
molecular weight of 51.8 kD; the amino acid sequence is shown in Figure 3. 
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The gene encoding the Mortierella alpina A6-desaturase can be expressed in 
transgenic plants or animals to effect greater synthesis of GLA from linoleic 
acid or of stearidonic acid (SD A) from ALA. Other DN As which are 
substantially identical in sequence to the Mortierella alpina A6-desaturase 
5 DNA, or which encode polypeptides which are substantially identical in 

sequence to the Mortierella alpina A6-desaturase polypeptide, also can be used. 

The Mortierella alpina A12-desaturase has the amino acid sequence 
shown in Figure 5. The gene encoding the Mortierella alpina A12-desaturase 
can be expressed in transgenic plants to effect greater synthesis of LA from 
10 oleic acid. Other DNAs which are substantially identical to the Mortierella 
alpina A12-desaturase DNA, or which encode polypeptides which are 
substantially identical to the Mortierella alpina A12-desaturase polypeptide, 
also can be used. 

By substantially identical in sequence is intended an amino acid 
1 5 sequence or nucleic acid sequence exhibiting in order of increasing preference 
at least 60%, 80%, 90% or 95% homology to the Mortierella alpina A5- 
desaturase amino acid sequence or nucleic acid sequence encoding the amino 
acid sequence. For polypeptides, the length of comparison sequences generally 
is at least 16 amino acids, preferably at least 20 amino acids, or most preferably 
20 35 amino acids. For nucleic acids, the length of comparison sequences 

generally is at least 50 nucleotides, preferably at least 60 nucleotides, and more 
preferably at least 75 nucleotides, and most preferably, 110 nucleotides. 
Homology typically is measured using sequence analysis software, for example, 
the Sequence Analysis software package of the Genetics Computer Group, 
25 University of Wisconsin Biotechnology Center, 1710 University Avenue, 
Madison, Wisconsin 53705, MEGAlign (DNAStar, Inc., 1228 S. Park St., 
Madison, Wisconsin 53715), and Mac Vector (Oxford Molecular Group, 2105 S. 
Bascom Avenue, Suite 200, Campbell, California 95008). Such software 
matches similar sequences by assigning degrees of homology to various 
30 substitutions, deletions, and other modifications. Conservative substitutions 

typically include substitutions within the following groups: glycine and alanine; 
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valine, isoleucine and leucine; aspartic acid, glutamic acid, asparagine, and 
glutamine; serine and threonine; lysine and arginine; and phenylalanine and 
tyrosine. Substitutions may also be made on the basis of conserved 
hydrophobicity or hydrophilicity (Kyte and Doolittle, J. Mol. Biol. 157: 105- 
5 1 32, 1 982), or on the basis of the ability to assume similar polypeptide 
secondary structure (Chou and Fasman, Adv. Enzymol. 47: 45-148, 1978). 

Other Desaturases 

Encompassed by the present invention are related desaturases from the 
same or other organisms. Such related desaturases include variants of the 

1 0 disclosed A5-, A6- and A 1 2-desaturases that occur naturally within the same or 
different species of Mortierella, as well as homologues of the disclosed A5- 
desaturase from other species and evolutionarily related protein having 
desaturase activity. Also included are desaturases which, although not 
substantially identical to the Mortierella alpina A5-desaturase, desaturate a fatty 

1 5 acid molecule at carbon 5, 6 or 1 2, respectively, from the carboxyl end of a fatty 
acid molecule. Related desaturases can be identified by their ability to function 
substantially the same as the disclosed desaturases; that is, are still able to 
effectively convert DGLA to ARA, LA to GLA, ALA to SDA or oleic acid to 
LA. Related desaturases also can be identified by screening sequence databases 

20 for sequences homologous to the disclosed desaturase, by hybridization of a 

probe based on the disclosed desaturase to a library constructed from the source 
organism, or by RT-PCR using mRNA from the source organism and primers 
based on the disclosed desaturase. Such desaturases includes those from 
humans, Dictyostelium discoideum and Phaeodactylum tricormim. 

25 The regions of a desaturase polypeptide important for desaturase activity 

can be determined through routine mutagenesis, expression of the resulting 
mutant polypeptides and determination of their activities. Mutants may include 
deletions, insertions and point mutations, or combinations thereof. A typical 
functional analysis begins with deletion mutagenesis to determine the N- and C- 

30 terminal limits of the protein necessary for function, and then internal deletions, 
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insertions or point mutants are made to further determine regions necessary for 
function. Other techniques such as cassette mutagenesis or total synthesis also 
can be used. Deletion mutagenesis is accomplished, for example, by using 
exonucleases to sequentially remove the 5* or 3' coding regions. Kits are 

5 available for such techniques. After deletion, the coding region is completed by 
ligating oligonucleotides containing start or stop codons to the deleted coding 
region after 5' or 3' deletion, respectively. Alternatively, oligonucleotides 
encoding start or stop codons are inserted into the coding region by a variety of 
methods including site-directed mutagenesis, mutagenic PCR or by ligation 

10 onto DNA digested at existing restriction sites. Internal deletions can similarly 
be made through a variety of methods including the use of existing restriction 
sites in the DNA, by use of mutagenic primers via site directed mutagenesis or 
mutagenic PCR. Insertions are made through methods such as linker-scanning 
mutagenesis, site-directed mutagenesis or mutagenic PCR. Point mutations are 

1 5 made through techniques such as site-directed mutagenesis or mutagenic PCR. 

Chemical mutagenesis can also be used for identifying regions of a 
desaturase polypeptide important for activity. A mutated construct is expressed, 
and the ability of the resulting altered protein to function as a desaturase is 
assayed. Such structure-function analysis can determine which regions may be 
20 deleted, which regions tolerate insertions, and which point mutations allow the 
mutant protein to function in substantially the same way as the native 
desaturase. All such mutant proteins and nucleotide sequences encoding them 
are within the scope of the present invention. 

EXPRESSION OF DESATURASE GENES 

25 Once the DNA encoding a desaturase polypeptide has been obtained, it 

is placed in a vector capable of replication in a host cell, or is propagated in 
vitro by means of techniques such as PCR or long PCR. Replicating vectors 
can include plasmids, phage, viruses, cosmids and the like. Desirable vectors 
include those useful for mutagenesis of the gene of interest or for expression of 

30 the gene of interest in host cells. The technique of long PCR has made in vitro 
propagation of large constructs possible, so that modifications to the gene of 
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interest, such as mutagenesis or addition of expression signals, and propagation 
of the resulting constructs can occur entirely in vitro without the use of a 
replicating vector or a host cell. 

For expression of a desaturase polypeptide, functional transcriptional 
5 and translational initiation and termination regions are operably linked to the 
DNA encoding the desaturase polypeptide. Transcriptional and translational 
initiation and termination regions are derived from a variety of nonexclusive 
sources, including the DNA to be expressed, genes known or suspected to be 
capable of expression in the desired system, expression vectors, chemical 

1 0 synthesis, or from an endogenous locus in a host cell. Expression in a plant 
tissue and/or plant part presents certain efficiencies, particularly where the 
tissue or part is one which is easily harvested, such as seed, leaves, fruits, 
flowers, roots, etc. Expression can be targeted to that location within the plant 
by using specific regulatory sequences, such as those of USPN 5,463,174, 

15 USPN 4,943,674, USPN 5,106,739, USPN 5,175,095, USPN 5,420,034, USPN 
5,188,958, and USPN 5,589,379. Alternatively, the expressed protein can be an 
enzyme which produces a product which may be incorporated, either directly or 
upon further modifications, into a fluid fraction from the host plant. In the 
present case, expression of desaturase genes, or antisense desaturase transcripts, 

20 can alter the levels of specific PUFAs, or derivatives thereof, found in plant 
parts and/or plant tissues. The A5-desaturase polypeptide coding region is 
expressed either by itself or with other genes, in order to produce tissues and/or 
plant parts containing higher proportions of desired PUFAs or in which the 
PUFA composition more closely resembles that of human breast milk (Prieto et 

25 al., PCT publication WO 95/24494). The termination region can be derived 
from the 3' region of the gene from which the initiation region was obtained or 
from a different gene. A large number of termination regions are known to and 
have been found to be satisfactory in a variety of hosts from the same and 
different genera and species. The termination region usually is selected more as 

30 a matter of convenience rather than because of any particular property. 
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The choice of a host cell is influenced in part by the desired PUFA 
profile of the transgenic cell, and the native profile of the host cell. As an 
example, for production of linoleic acid from oleic acid, the DNA sequence 
used encodes a polypeptide having A12 desaturase activity, and for production 
5 of GLA from linoleic acid, the DNA sequence used encodes a polypeptide 
having A6 desaturase activity. Use of a host cell which expresses A12 
desaturase activity and lacks or is depleted in A15 desaturase activity, can be 
used with an expression cassette which provides for overexpression of A6 
desaturase alone generally is sufficient to provide for enhanced GLA production 

10 in the transgenic cell. Where the host cell expresses A9 desaturase activity, 
expression of both a A 12- and a A6-desaturase can provide for enhanced GLA 
production. In particular instances where expression of A6 desaturase activity is 
coupled with expression of A12 desaturase activity, it is desirable that the host 
cell naturally have, or be mutated to have, low A15 desaturase activity. 

1 5 Alternatively, a host cell for A6 desaturase expression may have, or be mutated 
to have, high A12 desaturase activity. 

Expression in a host cell can be accomplished in a transient or stable 
fashion. Transient expression can occur from introduced constructs which 
contain expression signals functional in the host cell, but which constructs do 

20 not replicate and rarely integrate in the host cell, or where the host cell is not 
proliferating. Transient expression also can be accomplished by inducing the 
activity of a regulatable promoter operably linked to the gene of interest, 
although such inducible systems frequently exhibit a low basal level of 
expression. Stable expression can be achieved by introduction of a construct 

25 that can integrate into the host genome or that autonomously replicates in the 
host cell. Stable expression of the gene of interest can be selected for through 
the use of a selectable marker located on or transfected with the expression 
construct, followed by selection for cells expressing the marker. When stable 
expression results from integration, integration of constructs can occur 

30 randomly within the host genome or can be targeted through the use of 

constructs containing regions of homology with the host genome sufficient to 
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target recombination with the host locus. Where constructs are targeted to an 
endogenous locus, all or some of the transcriptional and translation^ regulatory 
regions can be provided by the endogenous locus. 

When increased expression of the desaturase polypeptide in the source 
5 plant is desired, several methods can be employed. Additional genes encoding 
the desaturase polypeptide can be introduced into the host organism. 
Expression from the native desaturase locus also can be increased through 
homologous recombination, for example by inserting a stronger promoter into 
the host genome to cause increased expression, by removing destabilizing 
1 0 sequences from either the mRNA or the encoded protein by deleting that 
information from the host genome, or by adding stabilizing sequences to the 
mRNA (see USPN 4,910,141 and USPN 5,500,365.) 

When it is desirable to express more than one different gene, appropriate 
regulatory regions and expression methods, introduced genes can be propagated 

15 in the host cell through use of replicating vectors or by integration into the host 
genome. Where two or more genes are expressed from separate replicating 
vectors, it is desirable that each vector has a different means of replication. 
Each introduced construct, whether integrated or not, should have a different 
means of selection and should lack homology to the other constructs to maintain 

20 stable expression and prevent reassortment of elements among constructs. 
Judicious choices of regulatory regions, selection means and method of 
propagation of the introduced construct can be experimentally determined so 
that all introduced genes are expressed at the necessary levels to provide for 
synthesis of the desired products. 

25 Constructs comprising the gene of interest may be introduced into a host 

cell by standard techniques. These techniques include transfection, infection, 
holistic impact, electroporation, microinjection, scraping, or any other method 
which introduces the gene of interest into the host cell (see USPN 4,743,548, 
USPN 4,795,855, USPN 5,068,193, USPN 5,188,958, USPN 5,463,174, USPN 

30 5,565,346 and USPN 5,565,347). For convenience, a host cell which has been 
manipulated by any method to take up a DNA sequence or construct will be 
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referred to as "transformed" or "recombinant" herein. The subject host will 
have at least have one copy of the expression construct and may have two or 
more, depending upon whether the gene is integrated into the genome, 
amplified, or is present on an extrachromosomal element having multiple copy 
numbers. 

The transformed host cell can be identified by selection for a marker 
contained on the introduced construct. Alternatively, a separate marker 
construct may be introduced with the desired construct, as many transformation 
techniques introduce many DNA molecules into host cells. Typically, 
transformed hosts are selected for their ability to grow on selective media. 
Selective media may incorporate an antibiotic or lack a factor necessary for 
growth of the untransformed host, such as a nutrient or growth factor. An 
introduced marker gene therefor may confer antibiotic resistance, or encode an 
essential growth factor or enzyme, and permit growth on selective media when 
expressed in the transformed host cell. Desirably, resistance to kanamycin and 
the amino glycoside G418 are of interest (see USPN 5,034,322). Selection of a 
transformed host can also occur when the expressed marker protein can be 
detected, either directly or indirectly. The marker protein may be expressed 
alone or as a fusion to another protein. The marker protein can be detected by 
its enzymatic activity; for example (J galactosidase can convert the substrate X- 
gal to a colored product, and luciferase can convert luciferin to a light-emitting 
product. The marker protein can be detected by its light-producing or 
modifying characteristics; for example, the green fluorescent protein of 
Aequorea victoria fluoresces when illuminated with blue light. Antibodies can 
be used to detect the marker protein or a molecular tag on, for example, a 
protein of interest. Cells expressing the marker protein or tag can be selected, 
for example, visually, or by techniques such as FACS or panning using 
antibodies. 

The PUFAs produced using the subject methods and compositions may 
be found in the host plant tissue and/or plant part as free fatty acids or in 
conjugated forms such as acylglycerols, phospholipids, sulfolipids or 
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glycolipids, and may be extracted from the host cell through a variety of means 
well-known in the art. Such means may include extraction with organic 
solvents, sonication, supercritical fluid extraction using for example carbon 
dioxide, and physical means such as presses, or combinations thereof. Of 
5 particular interest is extraction with hexane or methanol and chloroform. Where 
desirable, the aqueous layer can be acidified to protonate negatively charged 
moieties and thereby increase partitioning of desired products into the organic 
layer. After extraction, the organic solvents can be removed by evaporation 
under a stream of nitrogen. When isolated in conjugated forms, the products are 
1 0 enzymatically or chemically cleaved to release the free fatty acid or a less 

complex conjugate of interest, and are then subjected to further manipulations to 
produce a desired end product. Desirably, conjugated forms of fatty acids are 
cleaved with potassium hydroxide. 

PURIFICATION OF FATTY ACIDS 

1 5 If further purification is necessary, standard methods can be employed. 

Such methods include extraction, treatment with urea, fractional crystallization, 
HPLC, fractional distillation, silica gel chromatography, high speed 
centrifugation or distillation, or combinations of these techniques. Protection of 
reactive groups, such as the acid or alkenyl groups, may be done at any step 

20 through known techniques, for example alkylation or iodination. Methods used 
include methylation of the fatty acids to produce methyl esters. Similarly, 
protecting groups may be removed at any step. Desirably, purification of 
fractions containing ARA, DHA and EPA is accomplished by treatment with 
urea and/or fractional distillation. 

- 5 USES OF FATTY ACIDS 

The uses of the fatty acids of subject invention are several. Probes based 
on the DNAs of the present invention may find use in methods for isolating 
related molecules or in methods to detect organisms expressing desaturases. 
When used as probes, the DNAs or oligonucleotides need to be detectable. This 
50 is usually accomplished by attaching a label either at an internal site, for 
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example via incorporation of a modified residue, or at the 5' or 3* terminus. 
Such labels can be directly detectable, can bind to a secondary molecule that is 
detectably labeled, or can bind to an unlabelled secondary molecule and a 
detectably labeled tertiary molecule; this process can be extended as long as is 
5 practical to achieve a satisfactorily detectable signal without unacceptable levels 
of background signal. Secondary, tertiary, or bridging systems can include use 
of antibodies directed against any other molecule, including labels or other 
antibodies, or can involve any molecules which bind to each other, for example 
a biotin-streptavidin/avidin system. Detectable labels typically include 

10 radioactive isotopes, molecules which chemically or enzymatically produce or 
alter light, enzymes which produce detectable reaction products, magnetic 
molecules, fluorescent molecules or molecules whose fluorescence or light- 
emitting characteristics change upon binding. Examples of labelling methods 
can be found in USPN 5,01 1 ,770. Alternatively, the binding of target molecules 

1 5 can be directly detected by measuring the change in heat of solution on binding 
of probe to target via isothermal titration calorimetry, or by coating the probe or 
target on a surface and detecting the change in scattering of light from the 
surface produced by binding of target or probe, respectively, as may be done 
with the BIAcore system. 

20 PUFAs of the subject invention produced by recombinant means find 

applications in a wide variety of areas. Supplementation of humans or animals 
with PUFAs in various forms can result in increased levels not only of the 
added PUFAs, but of their metabolic progeny as well. For example, where the 
inherent A6-desaturase pathway is dysfunctional in an individual, treatment with 

25 GLA can result not only in increased levels of GLA, but also of downstream 
products such as ARA and prostaglandins (see Figure 1). Complex regulatory 
mechanisms can make it desirable to combine various PUFAs, or to add 
different conjugates of PUFAs, in order to prevent, control or overcome such 
mechanisms to achieve the desired levels of specific PUFAs in an individual. 

30 PUFAs, or derivatives thereof, made by the disclosed method can be 

used as dietary supplements, particularly in infant formulas, for patients 
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undergoing intravenous feeding or for preventing or treating malnutrition. 
Particular fatty acids such as EPA are used to alter the composition of infant 
formulas to better replicate the PUFA composition of human breast milk. The 
predominant triglyceride in human milk has been reported to be l,3-di-oleoyl-2- 
palmitoyl, with 2-palmitoyl glycerides reported as better absorbed than 2-oleoyl 
or 2-lineoyl glycerides (USPN 4,876,107). Typically, human breast milk has a 
fatty acid profile comprising from about 0.15 % to about 0.36 % as DHA, from 
about 0.03 % to about 0.13 % as EPA, from about 0.30 % to about 0.88 % as 
ARA, from about 0.22 % to about 0.67 % as DGLA, and from about 0.27 % to 
about 1.04 % as GLA. A preferred ratio of GLA:DGLA:ARA in infant 
formulas is from about 1 : 1 :4 to about 1:1:1, respectively. Amounts of oils 
providing these ratios of PUFA can be determined without undue 
experimentation by one of skill in the art. PUFAs, or host cells containing 
them, also can be used as animal food supplements to alter an animal's tissue or 
1 5 milk fatty acid composition to one more desirable for human or animal 
consumption. 

NUTRITIONAL COMPOSITIONS 

The present invention also includes nutritional compositions. Such 
compositions, for purposes of the present invention, include any food or 
preparation for human consumption including for enteral or parenteral 
consumption, which when taken into the body (a) serve to nourish or build up 
tissues or supply energy and/or (b) maintain, restore or support adequate 
nutritional status or metabolic function. 

The nutritional composition of the present invention comprises at least 
one oil or acid produced in accordance with the present invention and may 
either be in a solid or liquid form. Additionally, the composition may include 
edible macronutrients, vitamins and minerals in amounts desired for a particular 
use. The amount of such ingredients will vary depending on whether the 
composition is intended for use with normal, healthy infants, children or adults 
having specialized needs such as those which accompany certain metabolic 
conditions (e.g., metabolic disorders). 



20 



25 
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Examples of macronutrients which may be added to the composition 
include but are not limited to edible fats, carbohydrates and proteins. Examples 
of such edible fats include but are not limited to coconut oil, soy oil, and mono- 
and diglycerides. Examples of such carbohydrates include but are not limited to 
5 glucose, edible lactose and hydrolyzed search. Additionally, examples of 
proteins which may be utilized in the nutritional composition of the invention 
include but are not limited to soy proteins, electrodialysed whey , 
electrodialysed skim milk, milk whey, or the hydrolysates of these proteins. 

With respect to vitamins and minerals, the following may be added to 
10 the nutritional compositions of the present invention: calcium, phosphorus, 
potassium, sodium, chloride, magnesium, manganese, iron, copper, zinc, 
selenium, iodine, and Vitamins A, E, D, C, and the B complex. Other such 
vitamins and minerals may also be added. 

The components utilized in the nutritional compositions of the present 
15 invention will of semi-purified or purified origin. By semi-purified or purified 
is meant a material which has been prepared by purification of a natural 
material or by synthesis. 

Examples of nutritional compositions of the present invention include 
but are not limited to infant formulas, dietary supplements, and rehydration 
20 compositions. Nutritional compositions of particular interest include but are not 
limited to those utilized for enteral and parenteral supplementation for infants, 
specialist infant formulae, supplements for the elderly, and supplements for 
those with gastrointestinal difficulties and/or malabsorption. 

Nutritional Compositions 

25 A typical nutritional composition of the present invention will contain 

edible macronutrients, vitamins and minerals in amounts desired for a particular 
use. The amounts of such ingredients will vary depending on whether the 
formulation is intended for use with normal, healthy individuals temporarily 
exposed to stress, or to subjects having specialized needs due to certain chronic 

30 or acute disease states (e.g., metabolic disorders). It will be understood by 
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persons skilled in the art that the components utilized in a nutritional 
formulation of the present invention are of semi-purified or purified origin. By 
semi-purified or purified is meant a material that has been prepared by 
purification of a natural material or by synthesis. These techniques are well 
5 known in the art (See, e.g., Code of Federal Regulations for Food Ingredients 
and Food Processing; Recommended Dietary Allowances, 10 th Ed., National 
Academy Press, Washington, D.C., 1989). 

In a preferred embodiment, a nutritional formulation of the present 
invention is an enteral nutritional product, more preferably an adult or child 
10 enteral nutritional product. Accordingly in a further aspect of the invention, a 
nutritional formulation is provided that is suitable for feeding adults or children 
who are experiencing stress. The formula comprises, in addition to the PUFAs 
of the invention; macronutrients, vitamins and minerals in amounts designed to 
provide the daily nutritional requirements of adults. 

15 The macronutritional components include edible fats, carbohydrates and 

proteins. Exemplary edible fats are coconut oil, soy oil, and mono- and 
diglycerides and the PUFA oils of this invention. Exemplary carbohydrates are 
glucose, edible lactose and hydrolyzed cornstarch. A typical protein source 
would be soy protein, electfodialysed whey or electrodialysed skim milk or milk 

20 whey, or the hydrolysates of these proteins, although other protein sources are 
also available and may be used. These macronutrients would be added in the 
form of commonly accepted nutritional compounds in amount equivalent to 
those present in human milk or an energy basis, i.e., on a per calorie basis. 

Methods for formulating liquid and enteral nutritional formulas are well 
25 known in the art and are described in detail in the examples. 

The enteral formula can be sterilized and subsequently utilized on a 

ready-to-feed (RTF) basis or stored in a concentrated liquid or a powder. The 

powder can be prepared by spray drying the enteral formula prepared as 

indicated above, and the formula can be reconstituted by rehydrating the 

30 concentrate. Adult and infant nutritional formulas are well known in the art and 

commercially available (e.g., Similac®, Ensure®, Jevity® and Alimentum® 
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from Ross Products Division, Abbott Laboratories). An oil or acid of the 
present invention can be added to any of these formulas in the amounts 
described below. 

The energy density of the nutritional composition when in liquid form, 
can typically range from about 0.6 Kcal to 3 Kcal per ml. When in solid or 
powdered form, the nutritional supplement can contain from about 1.2 to more 
than 9 Kcals per gm, preferably 3 to 7 Kcals per gm. In general, the osmolality 
of a liquid product should be less than 700 mOsm and more preferably less than 
660 mOsm. 



The nutritional formula would typically include vitamins and minerals, 
in addition to the PUFAs of the invention, in order to help the individual ingest 
the minimum daily requirements for these substances. In addition to the PUFAs 
listed above, it may also be desirable to supplement the nutritional composition 
with zinc, copper, and folic acid in addition to antioxidants. It is believed that 
1 5 these substances will also provide a boost to the stressed immune system and 
thus will provide further benefits to the individual. The presence of zinc, 
copper or folic acid is optional and is not required in order to gain the beneficial 
effects on immune suppression. Likewise a pharmaceutical composition can be 
supplemented with these same substances as well. 

In a more preferred embodiment, the nutritional contains, in addition to 
the antioxidant system and the PUFA component, a source of carbohydrate 
wherein at least 5 weight % of said carbohydrate is an indigestible 
oligosaccharide. In yet a more preferred embodiment, the nutritional 
composition additionally contains protein, taurine and carnitine. 

The PUFAs, or derivatives thereof, made by the disclosed method can 
be used as dietary substitutes, or supplements, particularly infant formulas, for 
patients undergoing intravenous feeding or for preventing or treating 
malnutrition. Typically, human breast milk has a fatty acid profile comprising 
from about 0.15 % to about 0.36 % as DHA, from about 0.03 % to about 0.13 % 
30 as EPA, from about 0.30 % to about 0.88 % as ARA, from about 0.22 % to 
about 0.67 % as DGL A, and from about 0.27 % to about 1 .04 % as GLA. 
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Additionally, the predominant triglyceride in human milk has been reported to 
be l,3-di-oleoyl-2-palmitoyl, with 2-palmitoyl glycerides reported as better 
absorbed than 2-oleoyl or 2-lineoyl glycerides (USPN 4,876,107). Thus, fatty 
acids such as ARA, DGLA, GLA and/or EPA produced by the invention can be 
5 used to alter the composition of infant formulas to better replicate the PUFA 
composition of human breast milk. In particular, an oil composition for use in a 
pharmacologic or food supplement, particularly a breast milk substitute or 
supplement, will preferably comprise one or more of ARA, DGLA and GLA. 
More preferably the oil will comprise from about 0.3 to 30% ARA, from about 
1 0 0.2 to 30% DGLA, and from about 0.2 to about 30% GLA. 

In addition to the concentration, the ratios of ARA, DGLA and GLA can 
be adapted for a particular given end use. When formulated as a breast milk 
supplement or substitute, an oil composition which contains two or more of 
ARA, DGLA and GLA will be provided in a ratio of about 1 : 19:30 to about 
6: 1 :0.2, respectively. For example, the breast milk of animals can vary in ratios 
of ARA:DGLA:DGL ranging from 1:19:30 to 6:1:0.2, which includes 
intermediate ratios which are preferably about 1:1:1, 1:2:1, 1:1:4. When 
produced together in a host cell, adjusting the rate and percent of conversion of 
a precursor substrate such as GLA and DGLA to ARA can be used to precisely 
20 control the PUFA ratios. For example, a 5% to 10% conversion rate of DGLA 
to ARA can be used to produce an ARA to DGLA ratio of about 1:19, whereas 
a conversion rate of about 75% to 80% can be used to produce an ARA to 
DGLA ratio of about 6: 1 . Therefore, whether in a cell culture system or in a 
host animal, regulating the timing, extent and specificity of desaturase 
25 expression as described can be used to modulate the PUFA levels and ratios. 
Depending on the expression system used, e.g., cell culture or an animal 
expressing oil(s) in its milk, the oils also can be isolated and recombined in the 
desired concentrations and ratios. Amounts of oils providing these ratios of 
PUFA can be determined following standard protocols. PUFAs, or host cells 
containing them, also can be used as animal food supplements to alter an 
animal's tissue or milk fatty acid composition to one more desirable for human 
or animal consumption. 
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For dietary supplementation, the purified PUFAs, or derivatives thereof, 
may be incorporated into cooking oils, fats or margarines formulated so that in 
normal use the recipient would receive the desired amount. The PUFAs may 
also be incorporated into infant formulas, nutritional supplements or other food 
5 products, and may find use as anti-inflammatory or cholesterol lowering agents. 



The present invention also encompasses a pharmaceutical composition 
comprising one or more of the acids and/or resulting oils produced in 
accordance with the methods described herein. More specifically, such a 

10 pharmaceutical composition may comprise one or more of the acids and/or oils 
as well as a standard, well-known, non-toxic pharmaceutically acceptable 
carrier, adjuvant or vehicle such as, for example, phosphate buffered saline, 
water, ethanol, polyols, vegetable oils, a wetting agent or an emulsion such as a 
water/oil emulsion. The composition may be in either a liquid or solid form. 

1 5 For example, the composition may be in the form of a tablet, capsule, ingestible 
liquid or powder, injectible, or topical ointment or cream. 

Possible routes of administration include, for example, oral, rectal and 
parenteral. The route of administration will, of course, depend upon the desired 
effect. For example, if thecomposition is being utilized to treat rough, dry, or 
20 aging skin, to treat injured or burned skin, or to treat skin or hair affected by a 
disease or condition, it may perhaps be applied topically. 

The dosage of the composition to be administered to the patient may be 
determined by one of ordinary skill in the art and depends upon various factors 
such as weight of the patient, age of the patient, immune status of the patient, 



With respect to form, the composition may be, for example, a solution, a 
dispersion, a suspension, an emulsion or a sterile powder which is then 
reconstituted. 



Pharmaceutical Compositions 



25 



etc. 
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Additionally, the composition of the present invention may be utilized 
for cosmetic purposes. It may be added to pre-existing cosmetic compositions 
such that a mixture is formed or may be used as a sole composition. 

Pharmaceutical compositions may be utilized to administer the PUFA 
5 component to an individual. Suitable pharmaceutical compositions may 

comprise physiologically acceptable sterile aqueous or non-aqueous solutions, 
dispersions, suspensions or emulsions and sterile powders for reconstitution into 
sterile solutions or dispersions for ingestion. Examples of suitable aqueous and 
non-aqueous carriers, diluents, solvents or vehicles include water, ethanol, 

1 0 polyols (propyleneglycol, polyethyleneglycol, glycerol, and the like), suitable 
mixtures thereof, vegetable oils (such as olive oil) and injectable organic esters 
such as ethyl oleate. Proper fluidity can be maintained, for example, by the 
maintenance of the required particle size in the case of dispersions and by the 
use of surfactants. It may also be desirable to include isotonic agents, for 

15 example sugars, sodium chloride and the like. Besides such inert diluents, the 
composition can also include adjuvants, such as wetting agents, emulsifying and 
suspending agents, sweetening, flavoring and perfuming agents. 

Suspensions, in addition to the active compounds, may contain 
suspending agents, as for example, ethoxylated isostearyl alcohols, 
20 polyoxyethylene sorbitol and sorbitan esters, microcrystalline cellulose, 

aluminum metahydroxide, bentonite, agar-agar and tragacanth or mixtures of 
these substances, and the like. 

Solid dosage forms such as tablets and capsules can be prepared using 
techniques well known in the art. For example, PUFAs of the invention can be 

25 tableted with conventional tablet bases such as lactose, sucrose, and cornstarch 
in combination with binders such as acacia, cornstarch or gelatin, disintegrating 
agents such as potato starch or alginic acid and a lubricant such as stearic acid 
or magnesium stearate. Capsules can be prepared by incorporating these 
excipients into a gelatin capsule along with the antioxidants and the PUFA 

30 component. The amount of the antioxidants and PUFA component that should 
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be incorporated into the pharmaceutical formulation should fit within the 
guidelines discussed above. 

As used in this application, the term "treat 11 refers to either preventing, or 
reducing the incidence of, the undesired occurrence. For example, to treat 

5 immune suppression refers to either preventing the occurrence of this 

suppression or reducing the amount of such suppression. The terms "patient" 
and "individual" are being used interchangeably and both refer to an animal. 
The term "animal" as used in this application refers to any warm-blooded 
mammal including, but not limited to, dogs, humans, monkeys, and apes. As 

10 used in the application the term "about" refers to an amount varying from the 
stated range or number by a reasonable amount depending upon the context of 
use. Any numerical number or range specified in the specification should be 
considered to be modified by the term about. 

"Dose" and "serving" are used interchangeably and refer to the amount 
15 of the nutritional or pharmaceutical composition ingested by the patient in a 
single setting and designed to deliver effective amounts of the antioxidants and 
the structured triglyceride. As will be readily apparent to those skilled in the 
art, a single dose or serving of the liquid nutritional powder should supply the 
amount of antioxidants and PUFAs discussed above. The amount of the dose or 
20 serving should be a volume that a typical adult can consume in one sitting. This 
amount can vary widely depending upon the age, weight, sex or medical 
condition of the patient. However as a general guideline, a single serving or 
dose of a liquid nutritional produce should be considered as encompassing a 
volume from 100 to 600 ml, more preferably from 125 to 500 ml and most 
25 preferably from 125 to 300 ml. 

The PUFAs of the present invention may also be added to food even 
when supplementation of the diet is not required. For example, the composition 
may be added to food of any type including but not limited to margarines, 
modified butters, cheeses, milk, yogurt, chocolate, candy, snacks, salad oils, 
30 cooking oils, cooking fats, meats, fish and beverages. 
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Pharmaceutical Applications 

For pharmaceutical use (human or veterinary), the compositions are 
generally administered orally but can be administered by any route by which 
they may be successfully absorbed, e.g., parenterally (i.e. subcutaneously, 
5 intramuscularly or intravenously), rectally or vaginally or topically, for 

example, as a skin ointment or lotion. The PUFAs of the present invention may 
be administered alone or in combination with a pharmaceutically acceptable 
carrier or excipient. Where available, gelatin capsules are the preferred form of 
oral administration. Dietary supplementation as set forth above also can 

1 0 provide an oral route of administration. The unsaturated acids of the present 
invention may be administered in conjugated forms, or as salts, esters, amides 
or prodrugs of the fatty acids. Any pharmaceutically acceptable salt is 
encompassed by the present invention; especially preferred are the sodium, 
potassium or lithium salts. Also encompassed are the N-alkylpolyhydrdxamine 

1 5 salts, such as N-methyl glucamine, found in PCT publication WO 96/33 155. 
The preferred esters are the ethyl esters. As solid salts, the PUFAs also can be 
administered in tablet form. For intravenous administration, the PUFAs or 
derivatives thereof may be incorporated into commercial formulations such as 
Intralipids. The typical normal adult plasma fatty acid profile comprises 6.64 to 

20 9.46% of ARA, 1 .45 to 3. 1 1 % of DGLA, and 0.02 to 0.08% of GLA. These 
PUFAs or their metabolic precursors can be administered, either alone or in 
mixtures with other PUFAs, to achieve a normal fatty acid profile in a patient. 
Where desired, the individual components of formulations may be individually 
provided in kit form, for single or multiple use. A typical dosage of a particular 

25 fatty acid is from 0. 1 mg to 20 g, or even 1 00 g daily, and is preferably from 1 0 
mg to 1, 2, 5 or 10 g daily as required, or molar equivalent amounts of 
derivative forms thereof. Parenteral nutrition compositions comprising from 
about 2 to about 30 weight percent fatty acids calculated as triglycerides are 
encompassed by the present invention; preferred is a composition having from 

30 about 1 to about 25 weight percent of the total PUFA composition as GLA 

(USPN 5,196,198). Other vitamins, and particularly fat-soluble vitamins such 

as vitamin A, D, E and L-carnitine can optionally be included. Where desired a 
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preservative such as a tocopherol may be added, typically at about 0.1% by 
weight. 

Suitable pharmaceutical compositions may comprise physiologically 
acceptable sterile aqueous or non-aqueous solutions, dispersions, suspensions or 

5 emulsions and sterile powders for reconstitution into sterile injectible solutions 
or dispersions. Examples of suitable aqueous and non-aqeuous carriers, 
diluents, solvents or vehicles include water, ethanol, polyols (propylleneglyol, 
polyethylenegycol, glycerol, and the like), suitable mixtures thereof, vegetable 
oils (such as olive oil) and injectable organic esters such as ehyl oleate. Proper 

10 fluidity can be maintained, for example, by the maintenance of the required 

particle size in the case of dispersions and by the use of surfactants. It may also 
be desirable to include isotonic agents, for example sugars, sodium chloride and 
the like. Besides such inert diluents, the composition can also include 
adjuvants, such as wetting agents, emulsifying and suspending agents, 

1 5 sweetening, flavoring and perfuming agents. 

Suspensions in addition to the active compounds, may contain 
suspending agents, as for example, ethoxylated isostearyl alcohols, 
polyoxyethylene sorbitol and sorbitan esters, microcrystalline cellulose, 
aluminum metahydroxide, bentonite, agar-agar and tragacanth, or mixtures of 
20 these substances and the like. 

An especially preferred pharmaceutical composition contains 
diacetyltartaric acid esters of mono- and diglycerides dissolved in an aqueous 
medium or solvent. Diacetyltartaric acid esters of mono- and diglycerides have 
an HLB value of about 9-12 and are significantly more hydrophilic than existing 

25 antimicrobial lipids that have HLB values of 2-4. Those existing hydrophobic 
lipids cannot be formulated into aqueous compositions. As disclosed herein, 
those lipids can now be solubilized into aqueous media in combination with 
diacetyltartaric acid esters of mono-and diglycerides. In accordance with this 
embodiment, diacetyltartaric acid esters of mono- and diglycerides (e.g., 

30 DATEM-C12:0) is melted with other active antimicrobial lipids (e.g., 18:2 and 
12:0 monoglycerides) and mixed to obtain a homogeneous mixture. 
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Homogeneity allows for increased antimicrobial activity. The mixture can be 
completely dispersed in water This is not possible without the addition of 
diacetyitartaric acid esters of mono- and diglycerides and premixing with other 
monoglycerides prior to introduction into water. The aqueous composition can 
5 then be admixed under sterile conditions with physiologically acceptable 

diluents, preservatives, buffers or propellants as may be required to form a spray 
or inhalant. 

The present invention also encompasses the treatment of numerous 
disorders with fatty acids. Supplementation with PUFAs of the present 

10 invention can be used to treat restenosis after angioplasty. Symptoms of 

inflammation, rheumatoid arthritis, and asthma and psoriasis can be treated with 
the PUFAs of the present invention. Evidence indicates that PUFAs may be 
involved in calcium metabolism, suggesting that PUFAs of the present 
invention may be used in the treatment or prevention of osteoporosis and of 

1 5 kidney or urinary tract stones. 

The PUFAs of the present invention can be used in the treatment of 
cancer. Malignant cells have been shown to have altered fatty acid 
compositions; addition of fatty acids has been shown to slow their growth and 
cause cell death, and to increase their susceptibility to chemotherapeutic agents. 

20 GLA has been shown to cause reexpression on cancer cells of the E-cadherin 
cellular adhesion molecules, loss of which is associated with aggressive 
metastasis. Clinical testing of intravenous administration of the water soluble 
lithium salt of GLA to pancreatic cancer patients produced statistically 
significant increases in their survival. PUFA supplementation may also be 

25 useful for treating cachexia associated with cancer. 

The PUFAs of the present invention can also be used to treat diabetes 
(USPN 4,826,877; Horrobin et al, Am. J. Clin. Nutr. Vol. 57 (Suppl.), 732S- 
737S). Altered fatty acid metabolism and composition has been demonstrated 
in diabetic animals. These alterations have been suggested to be involved in 
30 some of the long-term complications resulting from diabetes, including 
retinopathy, neuropathy, nephropathy and reproductive system damage. 
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Primrose oil, which contains GLA, has been shown to prevent and reverse 
diabetic nerve damage. 

The PUFAs of the present invention can be used to treat eczema, reduce 
blood pressure and improve math scores. Essential fatty acid deficiency has 

5 been suggested as being involved in eczema, and studies have shown beneficial 
effects on eczema from treatment with GLA. GLA has also been shown to 
reduce increases in blood pressure associated with stress, and to improve 
performance on arithmetic tests. GLA and DGLA have been shown to inhibit 
platelet aggregation, cause vasodilation, lower cholesterol levels and inhibit 

1 0 proliferation of vessel wall smooth muscle and fibrous tissue (Brenner et al. , 
Adv. Exp. Med. Biol. Vol. 83, p. 85-101, 1976). Administration of GLA or 
DGLA, alone or in combination with EPA, has been shown to reduce or prevent 
gastro-intestinal bleeding and other side effects caused by non-steroidal anti- 
inflammatory drugs (USPN 4,666,701). GLA and DGLA have also been shown 

15 to prevent or treat endometriosis and premenstrual syndrome (USPN 4,758,592) 
and to treat myalgic encephalomyelitis and chronic fatigue after viral infections 
(USPN 5,116,871). 

Further uses of the PUFAs of this invention include use in treatment of 
AIDS, multiple schlerosis, acute respiratory syndrome, hypertension and 
20 inflammatory skin disorders. The PUFAs of the inventions also can be used for 
formulas for general health as well as for geriatric treatments. 

Veterinary Applications 

It should be noted that the above-described pharmaceutical and 
nutritional compositions may be utilized in connection with animals, as well as 
25 humans, as animals experience many of the same needs and conditions as 

human. For example, the oil or acids of the present invention may be utilized in 
animal feed supplements or as animal feed substitutes. 

The following examples are presented by way of illustration, not of 
limitation. 
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Examples 

Example 1 Isolation of A5 Desaturase Nucleotide Sequence from 
Mortierella alpina 

Example 2 Isolation of A6 Desaturase Nucleotide Sequence from 
Mortierella alpina 

Example 3 Identification of A6 Desaturases Homologues to the 
Mortierella alpina A Desaturase 

Example 4 Isolation of D-l 2 Desaturase Nucleotide Sequence from 
Mortierella alpina 

Example 5 Isolation of Cytochrome b5 Reductase Nucleotide 
Sequence from Mortierella alpina 

Example 6 Expression of M. alpina Desaturase Clones in Baker's 
Yeast 

Example 7 Fatty Acid Analysis of Leaves from Ma29 Transgenic 
Brassica Plants 

Example 8 Expression of M. alpina A6 Desaturase in Brassica 
napus 



20 



25 



Example 9 Expression of M. alpina A12 desaturase in Brassica 
napus 

Example 10 Simultaneous expression of M alpina A6 and A 12 
desaturases in Brassica napus 

Example 1 1 Simultaneous expression of M alpina A5 and A6 
desaturases in Brassica napus 

Example 12 Simultaneous expression of M. alpina A5, A6 and A12 
desaturases in Brassica napus 

Example 1 3 Stereospecific Distribution of A6-Desaturated Oils 

Example 14 Fatty Acid Compositions of Transgenic Plants 
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Example 1 5 Combined Expression of A6 and A12 Desaturases in B. 
napus Achieved by Crossing 

Example 16 Expression of M alpina desaturases in soybean 

Example 17 Human Desaturase Gene Sequences 

5 Example 1 

Isolation of a A5-desaturase Nucleotide Sequence from Mortierella alpina 
Motierella alpina produces arachidonic acid (ARA, 20:4) from the 
precursor 20:3 by a A5-desaturase. A nucleotide sequence encoding the A5- 
desaturase from Mortierella alpina (see Figure 7) was obtained through PCR 
10 amplification using M. alpina 1 st strand cDNA and degenerate oligonucleotide 
primers corresponding to amino acid sequences conserved between A6- 
desaturases from Synechocystis and Spirulina. The procedure used was as 
follows: 

Total RN A was isolated from a 3 day old PUFA-producing culture of 
1 5 Mortierella alpina using the protocol of Hoge et al (1 982) Experimental 

Mycology 6:225-232. The RNA was used to prepare double-stranded cDNA 
using BRL's lambda-ZipLox system, following the manufacturer's instructions. 
Several size fractions of the M alpina cDNA were packaged separately to yield 
libraries with different average-sized inserts. The "full-length" library contains 
20 approximately 3 x 10 6 clones with an average insert size of 1.77 kb. The 
"sequencing-grade" library contains approximately 6 x 10 5 clones with an 
average insert size of 1.1 kb. 

5 jig of total RNA was reverse transcribed using BRL Superscript RTase 
and the primer TSyn 5 ' -C AAGCTTCTGC AGG AGCTCTTTTTTTTTTTTTT^ 
25 3' (SEQ ID NO:19.) Degenerate oligonucleotides were designed to regions 
conserved between the two cyanobacterial A6-desaturase sequences. The 
specific primers used were: 
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D6DESAT-F3 (SEQ ID NO:20) 

5'-CUACUACUACUACAYCAYACOTAYACOAAYAT-3' 

D6DESAT-R3 ( SEQ ID NO:21) 

5 ' -C AUC AUC AUC AUOGGRAAO ARRTGRTG-3 ' 

5 where Y=C+T, R=A+G, and 0=I+C. PCR amplification was carried out in a 
25^ volume containing: template derived from 40 ng total RNA, 2 pM each 
primer, 200 |iM each deoxyribonucleotide triphosphate, 60 mM Tris-Cl, pH 8.5, 
1 5 mM (NH4)2S0 4 , 2 mM MgCb. Samples were subjected to an initial 
desaturation step of 95 degrees (all temperatures Celsius) for 5 minutes, then 

1 0 held at 72 degrees while 0.2 U of Taq polymerase were added. PCR 

thermocycling conditions were as follows: 94 degrees for 1 min., 45 degrees 
for 1 .5 min., 72 degrees for 2 min. PCR was continued for 35 cycles. PCR 
using these primers on the A£ alpina first-strand cDNA produced a 550 bp 
reaction product. Comparison of the deduced amino acid sequence of the M 

1 5 alpina PCR fragment revealed regions of homology with A6-desaturases {see 
Figure 4). However, there was only about 28% identity over the region 
compared. The deduced amino acid sequence is presented in SEQ ID NO: 14. 

The PCR product was used as a probe to isolate corresponding cDNA 
clones from a M. alpina library. The longest cDNA clone, Ma29, was 

20 designated pCGN5521 and has been completely sequenced on both strands. 
The cDNA is contained as a 1481 bp insert in the vector pZLl (Bethesda 
Research Laboratories) and, beginning with the first ATG, contains an open 
reading frame encoding 446 amino acids. The reading frame contains the 
sequence deduced from the PCR fragment. The sequence of the cDNA insert 

25 was found to contain regions of homology to A6-desaturases (see Figure 8). For 
example, three conserved "histidine boxes" (that have been observed in other 
membrane-bound desaturases (Okuley et al, (1994) The Plant Cell 5:147-158)) 
were found to be present in the Mortierella sequence at amino acid positions 
171-175, 207-212, and 387-391 (see Figure 5A-5D). However, the typical 

30 "HXXHH" amino acid motif for the third histidine box for the Mortierella 
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desaturase was found to be QXXHH. The amino-terminus of the encoded 
protein, showed significant homology to cytochrome b5 proteins. Thus, the 
Mortierella cDNA clone appears to represent a fusion between a cytochrome b5 
and a fatty acid desaturase. Since cytochrome b5 is believed to function as the 
5 electron donor for membrane-bound desaturase enzymes, it is possible that the 
N-terminal cytochrome b5 domain of this desaturase protein is involved in its 
function. This may be advantageous when expressing the desaturase in 
heterologous systems for PUFA production. 

Example 2 

10 Isolation of A6 Desat urase Nucleotide Sequence from Mortierella giving 
A nucleic acid sequence from a partial cDNA clone, Ma524, encoding a 
A6 fatty acid desaturase from Mortierella alpina was obtained by random 
sequencing of clones from the M. alpina cDNA library described in Example 1. 
cDNA-containing plasmids were excised as follows: 

15 Five jil of phage were combined with 100 \x\ ofE. coli DHIOB(ZIP) 

grown in ECLB plus 10 ^ig/ml kanamycin, 0.2% maltose, and 10 mM MgS0 4 
and incubated at 37 degrees for 15 minutes. 0.9 ml SOC was added and 100 ^1 
of the bacteria immediately plated on each of 10 ECLB + 50 ^g Pen plates. No 
45 minute recovery time was needed. The plates were incubated overnight at 37 

20 degrees. Colonies were picked into ECLB + 50 jig Pen media for overnight 
cultures to be used for making glycerol stocks and miniprep DNA. An aliquot 
of the culture used for the miniprep is stored as a glycerol stock7 Plating on 
ECLB + 50 fig Pen/ml resulted in more colonies and a greater proportion of 
colonies containing inserts than plating on 100 |ig/ml Pen. 

25 Random colonies were picked and plasmid DNA purified using Qiagen 

miniprep kits. DNA sequence was obtained from the 5 f end of the cDNA insert 
and compared to the databases using the BLAST algorithm. Ma524 was 
identified as a putative A6 desaturase based on DNA sequence homology to 
previously identified A6 desaturases. A full-length cDNA clone was isolated 
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from the M. alpina library. The abundance of this clone appears to be slightly 
(2X) less than Ma29. Ma524 displays significant homology to a portion of a 
Caenorhabditis elegans cosmid, W06D2.4, a cytochrome b5/desaturase fusion 
protein from sunflower, and the two A6 desaturases in the public databanks 
5 those from Synechocystis and Spirulina. 

In addition, Ma524 shows significant homology to the borage A6- 
desaturase sequence (PCT publication WO 96/21022). Ma524 thus appears to 
encode a A6-desaturase that is related to the borage and algal A6-desaturases. It 
should be noted that, although the amino acid sequences of Ma524 and the 

1 0 borage A6 are similar, the base composition of the cDNAs is quite different: the 
borage cDNA has an overall base composition of 60 % A+T, with some regions 
exceeding 70 %, while Ma524 has an average of 44 % A+T base composition, 
with no regions exceeding 60 %. This may have implications for expressing the 
cDNAs in microorganisms or animals which favor different base compositions. 

15 It is known that poor expression of recombinant genes can occur when the host 
has a very different base composition from that of the introduced gene. 
Speculated mechanisms for such poor expression include decreased stability or 
translatability of the mRNA. 

Example 3 

20 Identifica tion of A6-desatu rases Homologous 

to the Mortierella alpina A6-desaturase 

Nucleic acid sequences that encode putative A6-desaturases were 

identified through a BLASTX search of the est databases through NCBI using 

the Ma524 amino acid sequence. Several sequences showed significant 

25 homology. In particular, the deduced amino acid sequence of two Arabidopsis 
thaliana sequences, (accession numbers F 13728 and T42806) showed 
homology to two different regions of the deduced amino acid sequence of 
Ma524. The following PCR primers were designed: ATTS4723-FOR 
(complementary to F13728) 5*-CUACUACUACUAGGAGTCCTCTA 

30 CGGTGTTTTG, SEQ ID NO:22, and T42806-REV (complementary to 
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T42806) 5' CAUCAUCAUCAUATGATGCTCAAGCTGAAACTG, SEQ ID 
NO:23. Five \xg of total RNA isolated from developing siliques of Arabidopsis 
thaliana was reverse transcribed using BRL Superscript RTase and the primer 



5 NO:24). PCR was carried out in a 50 ul volume containing: template derived 
from 25 ng total RNA, 2 pM each primer, 200 \xM each deoxyribonucleotide 
triphosphate, 60 mM Tris-Cl, pH 8.5, 15 mM (NH^SO* 2 mM MgCl 2 , 0.2 U 
Taq Polymerase. Cycle conditions were as follows: 94 degrees for 30 sec, 50 
degrees for 30 sec, 72 degrees for 30 sec. PCR was continued for 35 cycles 

10 followed by an additional extension at 72 degrees for 7 minutes. PCR resulted 
in a fragment of -750 base pairs which was subsequently subcloned, named 12- 
5, and sequenced. Each end of this fragment corresponds to the Arabidopsis 
est from which the PCR primers were derived. This is the sequence named 12-5. 
The deduced amino acid sequence of 12-5 is compared to that of Ma524 and 

15 ests from human (W28140), mouse (W53753), and C. elegans (R05219) in 
Figure 4. Based on homology, these sequences represent desaturase 
polypeptides. The full-length genes can be cloned using probes based on the est 
sequences. The genes can then be placed in expression vectors and expressed in 
host cells and their specific A6- or other desaturase activity can be determined 

20 as described below. 



Isolation of A-12 Desaturase Nucleotide Sequence from Mortierella alpina 
Based on the fatty acids it accumulates, Mortierella alpina has an to6 
type desaturase. The co6 desaturase is responsible for the production of linoleic 
25 acid ( 1 8 :2) from oleic acid (18:1). Linoleic acid ( 1 8 :2) is a substrate for a A6 
desaturase. This experiment was designed to determine if Mortierella alpina 
has a A12-desaturase polypeptide, and if so, to identify the corresponding 
nucleotide sequence. A random colony from the M. alpina sequencing grade 
library, Ma648, was sequenced and identified as a putative desaturase based on 
30 DNA sequence homology to previously identified desaturases, as described for 



TSyn 5-CCAAGCTTCTGCAGGAGCTl 




-3', (SEQ ID 



Example 4 
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Ma524 (see Example 2). The deduced amino acid sequence from the 5* end of 
the Ma648 cDNA displays significant homology to soybean microsomal to6 
(A12) desaturase (accession #L43921) as well as castor bean oleate 12- 
hydroxylase (accession #U22378). In addition, homology is observed to a 
5 variety of other ©6 (A12) and t»3 (Al 5) fatty acid desaturase sequences. 

Example 5 

Isolation of Cytochrome b5 Reductase Nucleotide Sequence 
from MortiereUa alpina 

A nucleic acid sequence encoding a cytochrome b5 reductase from 

1 0 MortiereUa alpina was obtained as follows. A cDNA library was constructed 

based on total RNA isolated from MortiereUa alpina as described in Example 1 . 

DNA sequence was obtained from the 5' and 3' ends of one of the clones, M12- 

27. A search of public databanks with the deduced amino acid sequence of the 

3' end of Ml 2-27 (see Figure 5) revealed significant homology to known 

1 5 cytochrome b5 reductase sequences. Specifically, over a 49 amino acid region, 

the MortiereUa clone shares 55% identity (73% homology) with a cytochrome 

b5 reductase from pig (see Figure 4). 

Example 6 

Expression of M. alpina Desa turase Clones in Baker's Yeast 
20 Yeast Transformation 

Lithium acetate transformation of yeast was performed according to 
standard protocols (Methods in Enzymology, Vol. 194, p. 186-1 87, 1991). 
Briefly, yeast were grown in YPD at 30°C. Cells were spun down, resuspended 
in TE, spun down again, resuspended in TE containing 100 mM lithium acetate, 
25 spun down again, and resuspended in TE/lithium acetate. The resuspended 
yeast were incubated at 30°C for 60 minutes with shaking. Carrier DNA was 
added, and the yeast were aliquoted into tubes. Transforming DNA was added, 
and the tubes were incubated for 30 min. at 30°C. PEG solution (35% (w/v) 
PEG 4000, 100 mM lithium acetate, TE pH7.5) was added followed by a 50 
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min. incubation at 30°C. A 5 min. heat shock at 42°C was performed, the cells 
were pelleted, washed with TE, pelleted again and resuspended in TE. The 
resuspended cells were then plated on selective media. 

Desaturasc Expression in Transformed Yeast 

5 cDNA clones from Mortierella alpina were screened for desaturase 

activity in baker's yeast. A canola A 1 5-desaturase (obtained by PCR using 1 st 
strand cDNA from Brassica napus cultivar 212/86 seeds using primers based on 
the published sequence (Arondel et al Science 258:1353-1355)) was used as a 
positive control. The A 1 5-desaturase gene and the gene from cDNA clone 

1 0 Ma29 was put in the expression vector p YES2 (Invitrogen), resulting in 

plasmids pCGR-2 and pCGR-4, respectively. These plasmids were transfected 
into S. cerevisiae yeast strain 334 and expressed after induction with galactose 
and in the presence of substrates that allowed detection of specific desaturase 
activity. The control strain was S. cerevisiae strain 334 containing the unaltered 

1 5 p YES2 vector. The substrates used, the products produced and the indicated 
desaturase activity were: DGLA (conversion to ARA would indicate A5- 
desaturase activity), linoleic acid (conversion to GLA would indicate A6- 
desaturase activity; conversion to ALA would indicate A 1 5-desaturase activity), 
oleic acid (an endogenous substrate made by S. cerevisiae, conversion to 

20 linoleic acid would indicate A12-desaturase activity, which S. cerevisiae lacks), 
or ARA (conversion to EPA would indicate Al 7-desaturase activity). The 
results are provided in Table 1 below. The lipid fractions were extracted as 
follows: Cultures were grown for 48-52 hours at 15°C. Cells were pelleted by 
centrifugation, washed once with sterile ddH 2 0, and repelleted. Pellets were 

25 vortexed with methanol; chloroform was added along with tritridecanoin (as an 
internal standard). The mixtures were incubated for at least one hour at room 
temperature or at 4°C overnight. The chloroform layer was extracted and 
filtered through a Whatman filter with one gram of anhydrous sodium sulfate to 
remove particulates and residual water. The organic solvents were evaporated 

30 at 40°C under a stream of nitrogen. The extracted lipids were then derivatized 
to fatty acid methyl esters (FAME) for gas chromatography analysis (GC) by 



WO 98/46764 



PCT/US98/07421 



adding 2 ml of 0.5 N potassium hydroxide in methanol to a closed tube. The 
samples were heated to 95°C to 100°C for 30 minutes and cooled to room 
temperature. Approximately 2 ml of 14 % boron trifluoride in methanol was 
added and the heating repeated. After the extracted lipid mixture cooled, 2 ml 
5 of water and 1 ml of hexane were added to extract the FAME for analysis by 
GC. The percent conversion was calculated by dividing the product produced 
by the sum of (the product produced and the substrate added) and then 
multiplying by 100. To calculate the oleic acid percent conversion, as no 
substrate was added, the total linoleic acid produced was divided by the sum of 
10 (oleic acid and linoleic acid produced), then multiplying by 100. 
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Table 1 

M giving Desaturase Expression in Baker's Yeast 



CLONE TYPE OF ENZYME % CONVERSION 

ACTIVITY OF SUBSTRATE 

pCGR-2 A6 0 (18:2 to 18:3o6) 

(canolaA15 A15 16.3 (18:2 to 18:3co3) 

desaturase) A5 2.0 (20:3 to 20:4o>6) 

A17 2.8 (20:4to20:5©3) 

A12 1.8 (18:1 to 18:2©6) 

pCGR-4 A6 0 

(M. alpina A15 0 

A6-like, Ma29) A5 15.3 

A17 0.3 

A12 3.3 

pCGR-7 A6 0 

(M. alpina A15 3.8 

A12-like, Ma648 A5 2.2 

A17 0 

A12 63.4 



The A15-desaturase control clone exhibited 16.3% conversion of the 
5 substrate. The pCGR-4 clone expressing the Ma29 cDNA converted 15.3% of 
the 20:3 substrate to 20:4w6, indicating that the gene encodes a A5-desaturase. 
The background (non-specific conversion of substrate) was between 0-3% in 
these cases. The pCGR-5 clone expressing the Ma524 cDNA showed 6% 
conversion of the substrate to GLA, indicating that the gene encodes a A6- 
1 0 desaturase. The pCGR-7 clone expressing the Ma648 cDNA converted 63.4% 
conversion of the substrate to LA, indicating that the gene encodes a A12- 
desaturase. Substrate inhibition of activity was observed by using different 
concentrations of the substrate. When substrate was added to 100 ^iM, the 
percent conversion to product dropped as compared to when substrate was 
1 5 added to 25 |iM (see below). These data show that desaturases with different 
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substrate specificities can be expressed in a heterologous system and used to 
produce PUFAs. 

Table 2 represents fatty acids of interest as a percent of the total lipid 
extracted from the yeast host S. cerevisiae 334 with the indicated plasmid. No 
5 glucose was present in the growth media. Affinity gas chromatography was 
used to separate the respective lipids. GC/MS was employed to verify the 
identity of the product(s). The expected product for the B. napus A15- 
desaturase, a-linolenic acid, was detected when its substrate, linoleic acid, was 
added exogenously to the induced yeast culture. This finding demonstrates that 

1 0 yeast expression of a desaturase gene can produce functional enzyme and 
detectable amounts of product under the current growth conditions. Both 
exogenously added substrates were taken up by yeast, although slightly less of 
the longer chain PUFA, dihomo-y-linolenic acid (20:3), was incoiporated into 
yeast than linoleic acid (18:2) when either was added in free form to the induced 

1 5 yeast cultures, y-linolenic acid was detected when linoleic acid was present 
during induction and expression of S. cerevisiae 334 (pCGR-5). The presence 
of this PUFA demonstrates A6-desaturase activity from pCGR-5 (MA524). 
Linoleic acid, identified in the extracted lipids from expression of S. cerevisiae 
334 (pCGR-7), classifies the cDNA MA648 from M. alpina as the A12- 

20 desaturase. 
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Example 7 

Expression of A5 Desaturase in Plants 
Expression in Leaves 

This experiment was designed to determine whether leaves expressing 
5 Ma29 (as determined by Northern) were able to convert exogenously applied 
DGLA (20:3) to ARA (20:4). 

The Ma29 desaturase cDNA was modified by PCR to introduce 
convenient restriction sites for cloning. The desaturase coding region has been 
inserted into a d35 cassette under the control of the double 35S promoter for 
1 0 expression in Brassica leaves (pCGN5525) following standard protocols (see 
USPN 5,424,200 and USPN 5,106,739). Transgenic Brassica plants containing 
pCGN5525 were generated following standard protocols (see USPN 5,188,958 
and USPN 5,463,174). 

In the first experiment, three plants were used: a control, LP004-1 , and 
1 5 two transgenics,, 5525-23 and 5525-29. LP004 is a low-linolenic Brassica 
variety. Leaves of each were selected for one of three treatments: water, GLA 
or DGLA. GLA and DGLA were purchased as sodium salts from NuChek Prep 
and dissolved in water at 1 mg/ml. Aliquots were capped under N 2 and stored at 
-70 degrees C. Leaves were treated by applying a 50 ul drop to the upper 
20 surface and gently spreading with a gloved finger to cover the entire surface. 
Applications were made approximately 30 minutes before the end of the light 
cycle to minimize any photo-oxidation of the applied fatty acids. After 6 days 
of treatment one leaf from each treatment was harvested and cut in half through 
the mid rib.- One half was washed with water to attempt to remove 
25 unincorporated fatty acid. Leaf samples were lyophilized overnight, and fatty 
acid composition determined by gas chromatography (GC). The results are 
shown in Table 3. 
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Leaves treated with GLA contained from 1 .56 to 2.4 wt% GLA. The fatty acid 
analysis showed that the lipid composition of control and transgenic leaves was 
essentially the same. Leaves of control plants treated with DGLA contained 
1.2-1.9 w% DGLA and background amounts of ARA (.26-.27 wt%). 
5 Transgenic leaves contained only .2-.7 wt% DGLA, but levels of ARA were 
increased (.74-1 . 1 wt%) indicating that the DGLA was converted to ARA in 
these leaves. 

Expression in Seed 



1 0 with the seed specific napin promoter would enable expression in seed. 

The Ma29 cDNA was modified by PCR to introduce Xhol cloning sites 
upstream and downstream of the start and stop codons, respectively, using the 
following primers: 

Madxho-forward: 



(SEQ IDNO:25) 
Madxho-reverse: 

5 f -CAUCAUCAUCAUCTCGAGCTACTCTTCCTTGGGACGGAG 
(SEQ IDNO:26). 



CloneAmp system (GIBCOBRL) to create pCGN5522 and the A5 desaturase 
sequence was verified by sequencing of both strands. 

For seed-specific expression, the Ma29 coding region was cut out of 
pCGN5522 as an Xhol fragment and inserted into the Sail site of the napin 
25 expression cassette, pCGN3223, to create pCGN5528. The //wdlll fragment of 
pCGN5528 containing the napin 5' regulatory region, the Ma29 coding region, 
and the napin 3' regulatory region was inserted into the Hindlll site of 
pCGN1557 to create pCGN5531. Two copies of the napin transcriptional unit 
were inserted in tandem. This tandem construct can permit higher expression of 



The purpose of this experiment was to determine whether a construct 



15 



5-CUACUACUACUACTCGAGCAAGATGGGAACGGACCAAGG 



20 



The PCR product was subcloned into pAMPl (GIBCOBRL) using the 
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the desaturases per genetic loci. pCGN553 1 was introduced into Brassica 
napus cv.LP004 via Agrobacterium mediated transformation. 

The fatty acid composition of twenty-seed pools of mature T2 seeds was 
analyzed by GC. Table 4 shows the results obtained with independent 
5 transformed lines as compared to non-transformed LP004 seed. The transgenic 
seeds containing pCGN553 1 contain two fatty acids that are not present in the 
control seeds, tentatively identified as taxoleic acid (5,9-18:2) and pinolenic 
acid (5,9,12-18:3), based on their elution relative to oleic and linoleic acid. 
These would be the expected products of A5 desaturation of oleic and linoleic 
10 acids. No other differences in fatty acid composition were observed in the 
transgenic seeds. 
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Northern analysis is performed on plants to identify those expressing 
Ma29. Developing embryos are isolated approximately 25 days post anthesis or 
when the napin promoter is induced, and floated in a solution containing GLA 
or DGLA as described in Example 7. Fatty acid analysis of the embryos is then 
5 performed by GC to determine the amount of conversion of DGLA to ARA, 

following the protocol adapted for leaves in Example 7. The amount of ARA 
incorporated into triglycerides by endogenous Brassica acyltransferases is then 
evaluated by GC analysis as in Example 7. 

Example 8 

10 Expressio n of M. alpina A6 Desaturase in Brassica napus 

The Ma524 cDNA was modified by PCR to introduce cloning sites 
using the following primers: 



Ma524PCR-l (SEQ IDNO:27) 

15 5'-CUACUACUACUATCTAGACTCGAGACCATGGCTGCTGCT 
CCAGTGTG 



Ma524PCR-2 (SEQ ID NO:28) 

5-CAUCAUCAUCAUAGGCCTCGAGTTACTGCGCCTTACCCAT 

These primers allowed the amplification of the entire coding region and 
added Xbal an&Xhol sites to the 5'-end and Afcol and Stul sites to the 3' end. 
The PCR product was subcloned into pAMPl (GIBCOBRL) using the 
CloneAmp system (GIBCOBRL) to create pCGN5535 and the A6 desaturase 
sequence was verified by sequencing of both strands. 

For seed-specific expression, the Ma524 coding region was cut out of 
pCGN5535 as an ATioI fragment and inserted into the Sail site of the napin 
expression cassette, pCGN3223, to create P CGN5536. The Notl fragment of 
pCGN5536 containing the napin 5* regulatory region, the Ma524 coding region, 
and the napin 3' regulatory region was inserted into the Notl site of pCGN1557 
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to create pCGN5538. pCGN5538 was introduced into Brassica napus 
cv.LP004 via Agrobacterium mediated transformation. 

Maturing T2 seeds were collected from 6 independent transformation 
events in the greenhouse. The fatty acid composition of single seeds was 
5 analyzed by GC. Table 5 shows the results of control LP004 seeds and six 5538 

lines. All of the 5538 lines except #8 produced seeds containing GLA. 
Presence of GLA segregated in these seeds as is expected for the T2 selfed seed 
population. In addition to GLA, the M alpina A6 desaturase is capable of 
producing 18:4 (stearidonic) and another fatty acid believed to be the 6,9-18:2. 

1 0 The above results show that desaturases with three different substrate 

specificities can be expressed in a heterologous system and used to produce 
poly-unsaturated long chain fatty acids. Exemplified were the production of 
ARA (20:4) from the precursor 20:3 (DGLA), the production of GLA (18:3) 
from 18:2 substrate, and the conversion of 18:1 substrate to 18:2, which is the 

1 5 precursor for GLA. 
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Example 9 

Expression of M. alpina A12 desaturase in Brassica nanus 
The Ma648 cDNA was modified by PCR to introduce cloning sites 
using the following primers: 

5 Ma648PCR-for (SEQ ID NO:29) 

S'-CUACUACUACUAGGATCCATGGCACCTCCCAACACT 

Ma648PCR-rev (SEQ IDNO:30) 

5-CAUCAUCAUCAUGGTACCTCGAGTTACTTCTTGAAAAAGAC 

These primers allowed the amplification of the entire coding region and 
1 0 added a BamHI site to the 5' end and Kpnl and Xhol sites to the 3' end. The 

PCR product was subcloned into pAMPl (GIBCOBRL) using the CloneAmp 
system (GIBCOBRL) to create pCGN5540 and the A12 desaturase sequence 
was verified by sequencing of both strands. 

For seed-specific expression, the Ma648 coding region was cut out of 
1 5 pCGN5540 as a BamHI/XhoI fragment and inserted between the BgUI and 

Xhol sites of the napin expression cassette, pCGN3223, to create pCGN5542. 
The Asp718 fragment of pCGN5541 containing the napin 5' regulatory region, 
the Ma648 coding region, and the napin 3 1 regulatory region was inserted into 
the Asp718 site of pCGN5138 to create pCGN5542. PCGN5542 was 
20 introduced into two varieties of Brassica napus via Agrobacterium mediated 

transformation. The commercial canola variety, SP3 0021, and a low-linolenic 
line, LP30108 were used. 

Mature selfed T2 seeds were collected from 19 independent LP30108 
transformation events and a non-transformed control grown in the greenhouse. 
25 These seeds are expected to be segregating for the A 12 desaturase transgene. 

The fatty acid composition of 20-seed pools was analyzed by GC. The results 
are shown in Table 6. All transformed lines contained increased levels of 18:2, 
the product of the Al 2 desaturase. Levels of 18:3 were not significantly 
increased in these plants. Events # 1 1 and 16 showed the greatest accumulation 
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of 18:2 in the pooled seeds. To investigate the segregation of 18:2 levels in the 
T2 seeds and to identify individual plants to be taken on to subsequent 
generations, half-seed analysis was done. Seeds were germinated overnight in 
the dark at 30 degrees on water-soaked filter paper. The outer cotyledon was 
5 excised for GC analysis and the rest of the seedling was planted in soil. Results 

of some of these analyses are shown in Table 7. Individual T2 seeds containing 
the A£ alpina A12 desaturase accumulated up to 60% 18:2 in the seeds. Sample 
97xxl 116 #59 is an example of a null segregant. Even in the highest 18:2 
accumulators, levels of 18:3 were increased only slightly. These and other 
10 individually selected T2 plants were grown in the greenhouse and in the field to 

produce T3 seed. 

Mature selfed T2 seeds were collected from 20 independent SP30021 
transformation events and a non-transformed control grown in the greenhouse. 
These seeds are expected to be segregating for the A 12 desaturase transgene. 

1 5 The fatty acid composition of 20-seed pools was analyzed by GC. The data are 

presented in Table 8. All transformed lines contained increased levels of 18:2, 
the product of the A12 desaturase. As in the low-linolenic LP30108 line, levels 
of 18:3 were not significantly increased. Events # 4 and 12 showed the greatest 
accumulation of 18:2 in the pooled seeds. To investigate the segregation of 

20 1 8:2 levels in the T2 seeds and to identify individual plants to be taken on to 

subsequent generations, alf-seed analysis was done. Seeds were germinated 
overnight in the dark at 30 degrees on water-soaked filter paper. The outer 
cotyledon was excised for GC analysis and the rest of the seedling was planted 
in soil. Results of some of these analyses are shown in Table 9. Samples 

25 97xxl 157 #88 and #18 are examples of null segregants for 5542-SP30021-4 and 

5542-SP30021-12 respectively. These and other individually selected T2 plants 
were grown in the greenhouse and in the field to produce T3 seed 
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Example 10 

Simultaneous expression of M alpina A6 and A 12 
desaturases in Brassica nanus 

5 In order to express the M. alpina A6 and A12 desaturases from the same 

T-DNA, the following construct for seed-specific expression was made. 

The NotI fragment of pCGN5536 containing the containing the napin 5' 
regulatory region, the Ma524 coding region, and the napin 3' regulatory region 
was inserted into the NotI site of pCGN5542 to create pCGN5544. The 
10 expression modules were oriented in such a way that the direction of 

transcription from Ma524 and Ma648 and the nptll marker is the same. 

PCGN5544 was introduced into Brassica napus cv.LP30108 via 
Agrobacterium mediated transformation. Mature selfed T2 seeds were collected 
from 16 independent LP30108 transformation events and a non-transformed 
1 5 control that were grown in the greenhouse. These seeds are expected to be 

segregating for the A6+ A 12 desaturase transgene. The fatty acid composition 
of 20-seed pools was analyzed by GC. The results are presented in Table 10. 
All but one of the lines (5544-LP30108-3) shows an altered oil composition as 
compared to the controls. GLA was produced in all but three of the lines (-3, -4, 
20 -11); two of the three without GLA ( -4, -1 1) showed increased 18:2 indicative 

of expression of the A 12 desaturase. As a group, the levels of GLA observed in 
plants containing the double A6 + A12 construct (pCGN5544) were higher than 
those of plants containing pCGN5538 (A6 alone). In addition, levels of the A 6,9 
1 8:2 are much reduced in the plants containing the A12 + A6 as compared to A6 
25 alone. Thus, the combination of A6 and A 12 desaturases on one T-DNA leads 

to the accumulation of more GLA and fewer side products than expression of 
A6 desaturase alone. To investigate the segregation of GLA levels in the T2 
seeds and to identify individual plants to be taken on to subsequent generations, 
half-seed analysis was done. Seeds were germinated overnight in the dark at 30 
30 degrees on water-soaked filter paper. The outer cotyledon was excised for GC 

analysis and the rest of the seedling was planted in soil. Results of some of 
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these analyses are shown in Table 1 1. As expected for the T2 population, levels 
of GLA and 18:2 are segregating in the individual seeds. GLA content of up to 
60% of total fatty acids was observed in individual seeds. Individual events 
were selected to be grown in the greenhouse and field for production of T3 
5 seed. 

Transgenic plants including Brassica, soybean, safflower, corn flax and 
sunflower expressing the constructs of this invention can be a good source of 
GLA. 

Typical sources of GLA such as borage produce at most 25% GLA. In 
0 contrast the plants in Table 1 0 contain up to 30% GLA. Furthermore, the 

individual seeds shown in Table 1 1 contain up to 60% GLA. 
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Example 11 

Simultaneous expr ession of M. alnina AS and Afi 
desaturases in Brassica napus 

In order to produce arachadonic acid (ARA) in transgenic canola oil 

both A5 and A6 desaturase activities need to be introduced. In order to facilitate 

downstream characterization and breeding, it may be advantageous to have both 

activities encoded by a single T-DNA. The following example illustrates the 

simultaneous expression of A5 and A6 desaturases. 

The Asp718 fragment of pCGN5528 containing the napin 5* regulatory 
region, the Ma29 coding region, and the napin 3' regulatory region was inserted 
into the Asp718 site of pCGN5138 to create pCGN5545. The NotI fragment of 
pCGN5536 containing the napin 5' regulatory region, the Ma524 coding region, 
and the napin 3' regulatory region was inserted into the NotI site of pCGN5545 
to create pCGN5546. The expression modules were oriented in such a way that 
the direction of transcription from Ma524 and Ma29 and the nptll marker is the 
same. 

PCGN5546 was introduced into Brassica napus cv.LP30108 via 
Agrobacterium mediated transformation. Mature selfed T2 seeds were collected 
from 30 independent LP30108 transformation events that were grown in the 
greenhouse. The fatty acid composition of 20-seed pools was analyzed by GC. 
The results are shown in Table 12. All the lines show expression of both 
desaturases as evidenced by the presence of A 5,9 18:2 (as seen in pCGN5531 
plants) and A 6,9 18:2 and GLA (as seen in pCGN5538 plants) 
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Example 12 

Simultaneous expression of M. alvina AS, A6 and A12 
desaturases in Brassica nanus 

5 In order to achieve optimal production of ARA in transgenic canola oil 

both the A6 and A12 desaturase activities may need to be present in addition to 
the A5 activity. In order to facilitate downstream characterization and breeding, 
it may be advantageous to have all of these activities encoded by a single T- 
DNA. The following example illustrates the simultaneous expression of A5, A6 

10 and A 12 desaturases. 

The Hindlll fragment of pCGN5528 containing the napin 5' regulatory 
region, the Ma29 coding region, and the napin 3 f regulatory region was inserted 
into the Hindlll site of pCGN5544 to create pCGN5547. The expression 
modules were oriented in such a way that the direction of transcription from 
1 5 Ma29, Ma524, Ma648 and the nptll marker is the same. 

PCGN5547 was introduced into Brassica napus cv.LP30108 via 
Agrobacterium mediated transformation. Mature selfed T2 seeds were collected 
from 30 independent LP30108 transformation events that were grown in the 
greenhouse. The fatty acid composition of 20-seed pools was analyzed by GC. 

20 The results are shown in Table 13. Twenty-seven of the lines show significant 

accumulation of GLA and in general the levels of GLA observed are higher 
than those seen in the 5546 plants that did not contain the A 12 desaturase. The 
A 12 desaturase appears to be active in most lines as evidenced by the lack of 
detectable A6,9 18:2 and elevated 18:2 levels in most plants. Small amounts of 

25 A5,9 1 8:2 are seen in the 5547 plants, although the levels are generally less than 

those observed in the 5546 plants. This may be due to the presence of the A 12 
desaturase which efficiently converts the 18:1 to 18:2 before it can be 
desaturated at the A5 position. 
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Example 13 

Stereospecific Distribution of A6-Desaturated Oils 

This experiment was designed to investigate the stereospecific 
distribution of the A6-desaturated oils in seeds expressing pCGN5538 (Ma 524 
5 cDNA). Three seed samples were used: 

1) Non-transformed B. napus cv. LP004 seeds (control) 

2) Segregating T2 seeds of pCGN5538-LP004-19 

3) Segregating T2 seeds of pCGN5538-LP004-29 
The following protocol was used for the analysis: 

10 1 . Seed Oil Extraction 

Fifty seeds were placed in a 12 x 32 mm vial and crushed with a glass 
rod. 1 .25 mL hexane was added and the mixture was vortexed. The seeds were 
extracted overnight on a shaker. The extract was then filtered through a 0.2 
micron filter attached to a lcc syringe. The extract was then dried down under 
15 nitrogen. The resulting oil was used for digestion and derivatization of the 

whole oil sample. 

2. Digestion 

A. Liquid Oil Digestion 

The stock lipase (from Rhizopus arrhizus, Sigma, L4384) was diluted to 
20 approximately 600,000 units/mL with a goal of obtaining 50% digestion of the 

TAG. The stock lipase is maintained at 4 degrees C and placed on ice. The 
amount of reagents may be adjusted according to the amount of oil to be 
digested. 

The following amounts are based on a 2.0 mg extracted oil sample. In a 
25 12 x 32 mm screw cap vial the following were added: 2.0 mg oil, 200 \xL 0.1 M 

tris HCI pH 7, 40 \iL 2.2 w/v% CaCl 2 2H 2 0, and 100 \xL 0.05 w/v % bile salts. 
The material was vortexed and sonicated to disperse the oil. Twenty \xL of 
diluted lipase was added and the mixture was vortexed continuously for 1 .0 
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minute at room temperature. A white precipitate formed. The reaction was 
stopped with 100 uL 6M HC1 and vortexing. Five hundred uL CHCl 3 :CH 3 OH 
(2:1) was added and the mixture was vortexed and held on ice while reaining 
digestions were carried out. Samples were vortexed again and centrifuged 

5 briefly to sharpen layers. The lower layer containing digest products was 

removed with a pasteur pipette and placed in a 12 x 32 mm crimp cap vial. The 
material was then re-extracted with 300 [iL CHCh, vortexed, centrifuged, and 
combined with the lower layers. The digest products were kept on ice as much 
as possible. HPLC separation is performed as soon as possible after digestion to 

10 minimize acyl migration. 

B. Solid Fat Digestion 

The procedure for liquid oil digestion described above was followed 
except that 20 1 1 :0 methyl ester is added to 2.0 mg solid fat. 

3. HPLC Separation 

1 5 The digestion products were dried down in chloroform to approximately 

200 \iL. Each sample was then transferred into an insert in an 8 x 40 mm shell 
vial and 30 jxL was injected for HPLC analysis. 

The high performance liquid chromatographic system was equipped 
with a Varex ELSD IIA evaporative light scattering detector with tube 
20 temperature at 105°C and nitrogen gas flow at 40 mL/min; a Waters 712 Wisp 

autosampler, three Beckman 1 14M Solvent Delivery Modules; a Beckman 
421 A controller, a Rheodyne pneumatically actuated stream splitter; and a 
Gilson micro fractionator. The chromatography column is a 220 x 4.6 mm, 5 
micron normal phase silica cartridge by Brownlee. 

25 The three solvents used were: 

A= hexane: toluene 1:1 

B= toluene: ethyl acetate 3:1 

C= 5% formic acid in ethyl acetate 
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The gradient profile was as follows: 



Time (min) 


Function 


Value 


Duration 


Oflow 


2.0 mlVmin 






0%B 


10 






0%C 


2 






2%C 


25 




6 min 


14.0 % C 


2 




1 min 


15.0 


End program 







A chromatographic standard mixture is prepared in hexane: toluene 1:1 
containing the following: 

0.2 mg/mL triglyceride 16:0 



5 2.0 mg/mL 16:0 Free Fatty Acid 

0.2 mg/mL dil6:0 mixed isomers (1,2-diacylglycerol and 1,3-diacylglycerol) 
0.2 mg/mL 3-mono acylglycerol 16:0 
0.2 mg/mL 2-mono acylglycerol 16:0 

For each sample, the fraction containing the 2-mag peak is collected 
10 automatically by method controlled timed events relays. A time delay is used to 

synchronize the detector with the collector's emitter. The 2-mag peaks are 
collected and the fractions are evaporated at room temperature overnight. 

The sn-2 composition results rely on minimization of acyl migration. 
Appearance of 1-monoacylglycerol and/or 3-monoacylglycerol peaks in the 
15 chromatograph means that acyl migration has occurred. 

4. Derivatization 

To derivatize the whole oil, 1.0 mg of the extracted whole oil was 
weighed into a 12 x 32 mm crimp cap vial. One mL toluene was then added. 
The sample is then vortexed and a 50 \xL aliquot was removed for 
20 derivatization. To the dried down 2-mag samples, 50 jiL toluene was added. To 

both the whole oil and 2-mag fractions 105 uL H2SO4/CH3OH @ 8.76 wt% is 
added. The cap was tightly capped and the sample is refluxed for 1 hour at 95 
degrees C. The sample was allowed to cool and 500 uL 10 w/v % NaCl in 
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water and 60 uL heptane was added. The organic layer was removed and 
inserted in a 12 x 32 mm crimp cap vial. 

5. GLC Analysis 

A Hewlett Packard model 6890 GC equipped with a split/splitless 
5 capillary inlet, FID detector, 6890 series autosampler and 3392A Alpha Omega 

integrator is set up for the capillary column as follows: 

A. Supelco Omegawax 250, 30 m length, 0.25 mm id, 0.25 urn film 
thickness 



injection port: 


260 C 


detector: 


270 C 


initial temp: 


170 C 


initial time: 


1.5 min 


rate: 


30 deg/min 


final temp: 


245 C 


final time: 


6.5 min 


injection vol: 


1.5 uL 


head pressure: 


25 psi 


split ratio: 


30 


carrier gas: 


He 


make-up gas: 


N 2 


FID gas: 


H + air 



Percent compositions of fatty acid methyl esters are calculated as mole 
percents. For carbon chain lengths less than 12, the use of theoretical or 
25 empirical response factors in the area percent calculation is desirable. 
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6. Calculations 

The mean distribution of each acyl group at each sn-l and sn-3 position 
was calculated. 

mean sn-l and sn-3 composition = (3 WO comp - MAG comp) / 2 
5 WO = whole oil 

MAG= monoacylglycerol 

The results of this analysis are presented in Table 14. The GLA and A 6,9 
18:2 are evenly distributed between the sn-2 and jn-1, 3 positions. This 
analysis can not discriminate between fatty acids in the sn-l vs. sn-3 positions. 
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Example 14 

Fattv Acid Compositions of Transgenic Plants 

A5 and A6 transgenic plants were analyzed for their fatty acid content. 

The following protocol was used for oil extraction: 

1 . About 400 mg of seed were weighed out in duplicate for each 
sample. 

2. The seeds were crushed in a motar and pestle. The mortar and 
pestle was rinsed twice with 3ml (2:1) (v:v) 
CHCl 3 :CH 3 OH/MeOH. An additional 6 ml (2:1) was added to 
the 20ml glass vial (oil extracted in 12ml total 2:1). 

3. Samples were vortexed and placed on an orbital shaker for 2 
hours with occasional vortexing. 

4. SmloflMNaCl was added to each sample. Sample was 
vortexed then spun in centrifuge at 2000rpm for 5 minutes. 
Lower phase was drawn off using a pasteur pipette. 

5. Upper phase was re-extracted with an additional 5ml. Sample 
was vortexed then spun in centrifuge at 2000 rpm for 5 minutes. 
The lower phase was drawn off using a pasteur pipette and added 
to previous lower phase. 

6. CHCl3:CH30H /MeOH was evaporated under nitrogen using 
evaporative cooling. Vial containing extracted oil was sealed 
under nitrogen. Between 120mg- 160mg oil was extracted for 
each sample. 

For GC-MS analysis, fatty acid methyl esters were dissolved in an 
appropriate volume of hexane and analyzed using a Hewlett-Packard 5890 
Series II Plus gas chromatograph (Hewlett Packard, Palo Alto, CA) equipped 
with a 30 m x 0.32 mm i.d. Omegawax 320 fused sillica capillary column 
(Supelco, Bellefonte, PA) and a Hewlett-Packard 5972 Series mass selective 
detector. Mass spectra were intrepreted by comparison to the mass spectra in 
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NIST/EPA/NIH Chemical Structure Database using a MS Chem Station 
(#G1036A) (Hewlett Packard). 

Transgenic line 5531-6 was analyzed in duplicate (A, B) and compared 
to control line LP004-6. The fatty acid profile results are shown in Table 15. 

5 Transgenic line 5538-19 was analyzed in duplicate (A, B) and compared 

to control line LP004-6. The fatty acid profile results are shown in Table 16. 
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Table 15 
Fatty Acid Profile 
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0.060 
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C16:3 
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Table 15 
Fattv Acid Profile 
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TOTAL 


100.00 


100.00 


100.00 


100.00 
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Table 16 
Fatty Acid Profile 
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Table 16 
Fattv Acid Profile 
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TOTAL 


100.00 


100.00 


100.00 


100.00 
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Example 15 

Combined Expression of A6 and A12 
Desaturases in B. nap us Achieved by Crossing 

Plants containing either the A6 or the A 12 desaturase were crossed and 

5 individual Fl half-seeds were analyzed for fatty acid composition by GC. Data 

from one such cross are given in Table 17. The parents for the cross were 

5538-LP004-25-2-25 (A6 expressor) and 5542-SP30021-10-16 (A12 expressor). 

Reciprocal crosses were made and the results of 25 individual Fl seeds of each 

are shown in the table. Crosses are described such that the first parent indicated 

10 is the female. Both sets of crosses gave approximately the same results. 

Compared to the parents, the A 6 ' 9 1 8:2 decreased, and the GLA increased. A 9,12 
18:2 levels are increased in most of the FTs as well. Note that these are Fl 
seeds and only contain one set of each desaturase. In future generations and 
selection of events homoiygous for each desaturase, the F2 GLA levels 

1 5 obtained may be even higher. 

Combining traits by crossing may be preferable to combining traits on 
. one T-DN A in some situations. Particularly if both genes are driven off of the 
same promoter (in this case napin), issues of promoter silencing may favor this 
approach over putting nultiple cDN As on one construct. 

20 Alternatively, in some cases, combining multiple cDNAs on one T-DNA 

may be the method of choice. The results are shown in Table 17. 
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Example 16 

Expression of M. alvinz desaturases in soybean 

The M. alpina desaturases can be used to drive production of GLA and 
other PUFAs in soybean by use of the following expression constructs. Two 
means by which exogenous DNA can be inserted into the soybean genome are 
Agrobacterium infection or particle gun. Particle gun transformation is 
disclosed in U.S. patent 5,503,998. Plants can be selected using a glyphosate 
resistance marker (4, 971, 908). Agrobacterium transformation of soybean is 
well established to one of ordinary skill in the art. 

For seed specific expression, the coding regions of the desaturase 
cDNAs are placed under control of the 5' regulatory region of Glycine max 
alpha-type beta conglycinin storage protein gene. The specific region that can 
be used is nucleotides 78-921 of gi 169928 (Doyle, J.J., Schuler, M.A., ' 
Godette, W.D., Zenger, V., Beachy, R.N., and Slightom. J.L., 1986 J. Biol. 
Chem. 261 (20), 9228-9238). The 3' regulatory region that can be used is from 
the pea ribulose 1,5 bisphosphate carboxylase/oxygenase small subunit (rbcS) 
- gene. The specific sequences to be used are nucleotides 1-645 of gi 169145 
(Hunt, A.G. 1988 DNA 7: 329-336). 

Since soybean seeds contain more 1 8:2, and perhaps more endogenous 
A12 desaturase activity than Brassica, the effect of the Mortierella A12 
desaturase on achieving optimal GLA levels can be tested as follows. A 
construct containing the A6 cDNA can be used to see if A 6,9 18:2 is produced 
along with GLA. A construct containing the A 12 desaturase can be used to see 
if the amount of 1 8:2 can be increased in soybean. A construct containing both 
the A6 and A12 desaturases can be used to produce optimal levels of GLA. 
Alternatively, plants containing each of the single desaturases may be crossed if 
necessary to combine the genes. 

Similar constructs may be made to express the A5 desaturase alone, or in 
combination with A12 and/or A6 desaturases. 
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Example 17 

Human Desaturase Gene Sequences 

Human desaturase gene sequences potentially involved in long chain 
polyunsaturated fatty acid biosynthesis were isolated based on homology 

5 between the human cDNA sequences and Mortierella alpina desaturase gene 

sequences. The three conserved "histidine boxes" known to be conserved 
among membrane-bound desaturases were found. As with some other 
membrane-bound desaturases the final HXXHH histidine box motif was found 
to be QXXHH. The amino acid sequence of the putative human desaturases 

1 0 exhibited homology to M alpina A5, A6, A9, and A 12 desaturases. 

The M. alpina A5 desaturase and A6 desaturase cDNA sequences were 
used to search the LifeSeq database of Incyte Pharmaceuticals, Inc., Palo Alto, 
California 94304. The A5 desaturase sequence was divided into fragments; 1) 
amino acid no. 1-150, 2) amino acid no. 151-300, and 3) amino acid no. 301- 

15 446. The A6 desaturase sequence was divided into three fragments; 1) amino 

acid no. 1-150, 2) amino acid no. 151-300, and 3) amino acid no. 301-457. 
These polypeptide fragments were searched against the database using the 
"tblastn" algorithm. This alogarithm compares a protein query sequence against 
a nucleotide sequence database dynamically translated in all six reading frames 

20 (both strands). 

The polypeptide fragments 2 and 3 of M alpina A5 and A6 have 
homologies with the ClonelD sequences as outlined in Table 18. The ClonelD 
represents an individual sequence from the Incyte LifeSeq database. After the 
"tblastn" results have been reviewed, Clone Information was searched with the 

25 default settings of Stringency of >=50, and Productscore <=1 00 for different 

ClonelD numbers. The Clone Information Results displayed the information 
including the ClusterlD, ClonelD, Library, HitID, Hit Description. When 
selected, the ClusterlD number displayed the clone information of all the clones 
that belong in that ClusterlD. The Assemble command assembles all of the 

30 ClonelD which comprise the ClusterlD. The following default settings were 
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used for GCG (Genetics Computer Group, University of Wisconsin 
Biotechnology Center, Madison, Wisconsin 53705) Assembly: 



Word Size: 7 

5 Minimum Overlap: 14 

Stringency: 0.8 

Minimum Identity: 14 

Maximum Gap: 10 

Gap Weight: 8 

10 Length Weight: 2 



GCG Assembly Results displayed the contigs generated on the basis of 
sequence information within the ClonelD. A contig is an alignment of DNA 
sequences based on areas of homology among these sequences. A new 
sequence (consensus sequence) was generated based on the aligned DNA 
sequences within a contig. The contig containing the ClonelD was identified, 
and the ambiguous sites of the consensus sequence was edited based on the 
alignment of the ClonelDs (see SEQ ID NO:3 1 - SEQ ID NO:35) to generate 
the best possible sequence. The procedure was repeated for all six ClonelD 
listed in Table 18. This produced five unique contigs. The edited consensus 
sequences of the 5 contigs were imported into the Sequencher software program 
(Gene Codes Corporation, Ann Arbor, Michigan 48 105). These consensus 
sequences were assembled. The contig 251 1785 overlaps with contig 3506132, 
and this new contig was called 2535 (SEQ ID NO:37). The contigs from the 
Sequencher program were copied into the Sequence Analysis software package 
of GCG. 

Each contig was translated in all six reading frames into protein 
sequences. The M. alpina A5 (MA29) and A6 (MA524) sequences were 
compared with each of the translated contigs using the FastA search (a Pearson 
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and Lipman search for similarity between a query sequence and a group of 
sequences of the same type (nucleic acid or protein)). Homology among these 
sequences suggest the open reading frames of each contig. The homology 
among the M alpina A5 and A6 to contigs 2535 and 3854933 were utilized to 
5 create the final contig called 253538a. Figure 9 is the FastA match of the final 

contig 253538a and MA29, and Figure 10 is the FastA match of the final contig 
253538a and MA524. The DNA sequences for the various contigs are 
presented in SEQ ID NO:31 -SEQ ID NO:37 The various peptide sequences 
are shown in SEQ ID NO:38 - SEQ ID NO: 44. 

1 0 Although the open reading frame was generated by merging the two 

contigs, the contig 2535 shows that there is a unique sequence in the beginning 
of this contig which does not match with the contig 3854933. Therefore, it is 
possible that these contigs were generated from independent desaturase like 
human genes. 

15 The contig 253538a contains an open reading frame encoding 432 

amino acids. It starts with Gin (CAG) and ends with the stop codon (TGA). 
The contig 253538a aligns with both M. alpina A5 and A6 sequences, 
suggesting that it could be either of the desaturases, as well as other known 
desaturases which share homology with each other. The individual contigs 

20 listed in Table 18, as well as the intermediate contig 2535 and the final contig 

253538a can be utilized to isolate the complete genes for human desaturases. 

Uses of the Human Desaturases 

These human sequences can be expressed in yeast and plants utilizing 
the procedures described in the preceding examples. For expression in 
25 mammalian cells and transgenic animals, these genes may provide superior 

codon bias. In addition, these sequences can be used to isolate related 
desaturase genes from other organisms. 
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Table 18 



Sections of the 
Desaturases 


Clone ID from LifeSeq Database 


Keyword 


151-300 A5 


3808675 


fatty acid desaturase 


301-446 A5 


354535 


A6 


151-300 A6 


[ 3448789 


A6 


151-300 A6 


1362863 


A6 


151-300 A6 


2394760 


A6 


301-457 A6 


3350263 


A6 



Example 18 

Identification o f Homologues to M. alpina A5 and A6 desaturases 
A nucleic acid sequence that encodes a putative A5 desaturase was 
identified through a TBLASTN search of the expressed sequence tag databases 
through NCBI using amino acids 100-446 of M_a29 as a query. The truncated 
portion of the Ma29 sequence was used to avoid picking up homologies based 
on the cytochrome b5 portion at the N-terminus of the desaturase. The deduced 
amino acid sequence of an est from Dictyostelium discoideum (accession # 
C25549) shows very significant homology to Ma29 and lesser, but still 
significant homology to Ma524. The DNA sequence is presented as SEQ ID 
NO:45. The amino acid sequence is presented as SEQ ID NO:46. 

Example 19 

Identificat ion of M. alpina A5 and A6 homologues in other 
PUFA-produci pg organisms 

To look for desaturases involved in PUFA production, a cDNA library 

was constructed from total RNA isolated from Phaeodactylum tricornutum. A 

plasmid-based cDNA library was constructed in pSPORTl (GIBCO-BRL) 

following manufacturer's instructions using a commercially available kit 

(GIBCO-BRL). Random cDNA clones were sequenced and nucleic acid 

sequences that encode putative A5 or A6 desaturases were identified through 

BLAST search of the databases and comparison to Ma29 and Ma524 sequences. 



-105- 



WO 98/46764 




PCT/US98/07421 



One clone was identified from the Phaeodactylum library with 
homology to Ma29 and Ma524; it is called 144-01 1-B12. The DNA sequence is 
presented as SEQ ID NO:47. The amino acid sequence is presented as SEQ ID 
NO:48. 

5 Example 20 

Identification of M. alvina A5 and A6 homologues in other 
PUFA-producing organisms 

To look for desaturases involved in PUFA production, a cDNA library 

was constructed from total RNA isolated from Schizochytrium species. A 

10 plasmid-based cDNA library was constructed in pSPORTl (GIBCO-BRL) 

following manufacturer's instructions using a commercially available kit 

(GIBCO-BRL). Random cDNA clones were sequenced and nucleic acid 

sequences that encode putative A5 or A6 desaturases were identified through 

BLAST search of the databases and comparison to Ma29 and Ma524 sequences. 

15 One clone was identified from the Schizochytrium library with 

homology to Ma29 and Ma524; it is called 81 -23-C7. This clone contains a ~1 
kb insert. Partial sequence was obtained from each end of the clone using the 
universal forward and reverse sequencing primers. The DNA sequence from 
the forward primer is presented as SEQ ID NO:49. The peptide sequence is 

20 presented as SEQ ID NO:50. The DNA sequence from the reverse primer is 

presented as SEQ ID NO:51. The amino acid sequence from the reverse primer 
is presented as SEQ ID NO:52. 

Example 21 

Nutritional Compositions 

25 The PUFAs of the previous examples can be utilized in various 

nutritional supplements, infant formulations, nutritional substitutes and other 
nutrition solutions. 

I. INFANT FORMULATIONS 

A. Isomil® Soy Formula with Iron. 
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Usage: As a beverage for infants, children and adults with an allergy or 
sensitivity to cow's milk. A feeding for patients with disorders for which 
lactose should be avoided: lactase deficiency, lactose intolerance and 
galactosemia. 

5 Features: 

• Soy protein isolate to avoid symptoms of cow's-milk-protein 
allergy or sensitivity 

• Lactose-free formulation to avoid lactose-associated diarrhea 

• Low osmolaity (240 mOsm/kg water) to reduce risk of osmotic 
10 diarrhea. 

• Dual carbohydrates (com syrup and sucrose) designed to 
enhance carbohydrate absorption and reduce the risk of exceeding the 
absorptive capacity of the damaged gut. 

• 1 .8 mg of Iron (as ferrous sulfate) per 1 00 Calories to help 
1 5 prevent iron deficiency. 

• Recommended levels of vitamins and minerals. 

• Vegetable oils to provide recommended levels of essential fatty 
acids. 

• Milk-white color, milk-like consistency and pleasant aroma. 

20 Ingredients: (Pareve, ©) 85% water, 4.9% com syrup, 2.6% sugar 

(sucrose), 2.1% soy oil, 1.9% soy protein isolate, 1.4% coconut oil, 0.15% 
calcium citrate, 0.1 1 % calcium phosphate tribasic, potassium citrate, potassium 
phosphate monobasic, potassium chloride, mono- and disglycerides, soy 
lecithin, carrageenan, ascorbic acid, L-methionine, magnesium chloride, 

25 potassium phosphate dibasic, sodium chloride, choline chloride, taurine, ferrous 

sulfate, m-inositol, alpha-tocopheryl acetate, zinc sulfate, L-carnitine, 
niacinamide, calcium pantothenate, cupric sulfate, vitamin A palmitate, 
thiamine chloride hydrochloride, riboflavin, pyridoxine hydrochloride, folic 
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acid, manganese sulfate, potassium iodide, phylloquinone, biotin, sodium 
selenite, vitamin D3 and cyanocobalamin 

B. Isomil® DF Soy Formula For Diarrhea. 

Usage: As a short-term feeding for the dietary management of diarrhea 
5 in infants and toddlers. 

Features: 

• First infant formula to contain added dietary fiber from soy fiber 
specifically for diarrhea management. 

• Clinically shown to reduce the duration of loose, watery stools 
10 during mild to severe diarrhea in infants. 

• Nutritionally complete to meet the nutritional needs of the infant. 

• Soy protein isolate with added L-methionine meets or exceeds an 
infant's requirement for all essential amino acids. 

• Lactose-free formulation to avoid lactose-associated diarrhea. 

1 5 • Low osmolality (240 mOsm/kg water) to reduce the risk of 

osmotic diarrhea. 

• Dual carbohydrates (corn syrup and sucrose) designed to 
enhance carbohydrate absorption and reduce the risk of exceeding the 
absorptive capacity of the damaged gut. 

20 • Meets or exceeds the vitamin and mineral levels recommended 

by the Committee on Nutrition of the American Academy of Pediatrics 
and required by the Infant Formula Act. 

• 1 .8 mg of iron (as ferrous sulfate) per 1 00 Calories to help 
prevent iron deficiency. 

25 • Vegetable oils to provide recommended levels of essential fatty 

acids. 

Ingredients: (Pareve, ©) 86% water, 4.8% com syrup, 2.5% sugar 
(sucrose), 2.1% soy oil, 2.0% soy protein isolate, 1.4% coconut oil, 0.77% soy 

-108- 



WO 98/46764 




PCT/US98/07421 



fiber, 0.12% calcium citrate, 0.1 1 % calcium phosphate tribasic, 0.10% 
potassium citrate, potassium chloride, potassium phosphate monobasic, mono- 
and disglycerides, soy lecithin, carrageenan, magnesium chloride, ascorbic acid, 
L-methionine, potassium phosphate dibasic, sodium chloride, choline chloride, 
5 taurine, ferrous sulfate, m-inositol, alpha-tocopheryl acetate, zinc sulfate, L- 

carnitine, niacinamide, calcium pantothenate, cupric sulfate, vitamin A 
palmitate, thiamine chloride hydrochloride, riboflavin, pyridoxine 
hydrochloride, folic acid, manganese sulfate, potassium iodide, phylloquinone, 
biotin, sodium selenite, vitamin D 3 and cyanocobalamin 

1 0 C. Isomil® SF Sucrose-Free Soy Formula With Iron. 

Usage: As a beverage for infants, children and adults with an allergy or 
sensitivity to cow's-milk protein or an intolerance to sucrose. A feeding for 
patients with disorders for which lactose and sucrose should be avoided. 

Features: 

1 5 • Soy protein isolate to avoid symptoms of cow's-milk-protein 

allergy or sensitivity. 

• Lactose-free formulation to avoid lactose-associated diarrhea 
(carbohydrate source is Polycose® Glucose Polymers). 

• Sucrose free for the patient who cannot tolerate sucrose. 

20 # Low osmolality (1 80 mOsm/kg water) to reduce risk of osmotic 

diarrhea. 

• 1 .8 mg of iron (as ferrous sulfate) per 1 00 Calories to help 
prevent iron deficiency. 

• Recommended levels of vitamins and minerals. 

25 # Vegetable oils to provide recommended levels of essential fatty 

acids. 

• Milk-white color, milk-like consistency and pleasant aroma. 

Ingredients: (Pareve, ©) 75% water, 1 1.8% hydrolized cornstarch, 4.1% 
soy oil, 4.1% soy protein isolate, 2.8% coconut oil, 1.0% modified cornstarch, 

-109- 



WO 98/46764 




PCT/US98/07421 



0.38% calcium phosphate tribasic, 0.17% potassium citrate, 0.13% potassium 
chloride, mono- and disglycerides, soy lecithin, magnesium chloride, abscorbic 
acid, L-methionine, calcium carbonate, sodium chloride, choline chloride, 
carrageenan, taurine, ferrous sulfate, m-inositol, alpha-tocopheryl acetate, zinc 
5 sulfate, L-carnitine, niacinamide, calcium pantothenate, cupric sulfate, vitamin 

A palmitate, thiamine chloride hydrochloride, riboflavin, pyridoxine 
hydrochloride, folic acid, manganese sulfate, potassium iodide, phylloquinone, 
biotin, sodium selenite, vitamin D3 and cyanocobalamin 

D. Isomil® 20 Soy Formula With Iron Ready To Feed, 
10 20Cal/floz. 

Usage: When a soy feeding is desired. 

Ingredients: (Pareve, ®) 85% water, 4.9% corn syrup, 2.6% sugar 
(sucrose), 2.1% soy oil, 1.9% soy protein isolate, 1.4% coconut oil, 0.15% 
calcium citrate, 0.1 1% calcium phosphate tribasic, potassium citrate, potassium 

15 phosphate monobasic, potassium chloride, mono- and disglycerides, soy 

lecithin, carrageenan, abscorbic acid, L-methionine, magnesium chloride, 
potassium phosphate dibasic, sodium chloride, choline chloride, taurine, ferrous 
sulfate, m-inositol, alpha-tocopheryl acetate, zinc sulfate, L-carnitine, 
niacinamide, calcium pantothenate, cupric sulfate, vitamin A palmitate, 

20 thiamine chloride hydrochloride, riboflavin, pyridoxine hydrochloride, folic 

acid, manganese sulfate, potassium iodide, phylloquinone, biotin, sodium 
selenite, vitamin D3 and cyanocobalamin. 

E. Similac® Infant Formula 

Usage: When an infant formula is needed: if the decision is made to 
25 discontinue breastfeeding before age 1 year, if a supplement to breastfeeding is 

needed or as a routine feeding if breastfeeding is not adopted. _ 
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Features: 

• Protein of appropriate quality and quantity for good growth; 
heat-denatured, which reduces the risk of milk-associated enteric blood 
loss. 

5 • Fat from a blend of vegetable oils (doubly homogenized), 

providing essential linoleic acid that is easily absorbed. 

• Carbohydrate as lactose in proportion similar to that of human 
milk. 

• Low renal solute load to minimize stress on developing organs. 

1 0 • Powder, Concentrated Liquid and Ready To Feed forms. 

Ingredients: (®-D) Water, nonfat milk, lactose, soy oil, coconut oil, 
mono- and diglycerides, soy lecithin, abscorbic acid, carrageenan, choline 
chloride, taurine, m-inositol, alpha-tocopheryl acetate, zinc sulfate, niacinamid, 
ferrous sulfate, calcium pantothenate, cupric sulfate, vitamin A palmitate, 
1 5 thiamine chloride hydrochloride, riboflavin, pyridoxine hydrochloride, folic 

acid, manganese sulfate, phylloquinone, biotin, sodium selenite, vitamin D 3 and 
cyanocobalamin 

F. Similac® NeoCare Premature Infant Formula With Iron 

Usage: For premature infants 1 special nutritional needs after hospital 
20 discharge. Similac NeoCare is a nutritionally complete formula developed to 

provide premature infants with extra calories, protein, vitamins and minerals 
needed to promote catch-up growth and support development. 

Features: 

• Reduces the need for caloric and vitamin supplementation. More 
25 calories (22 Cal/fl oz) then standard term formulas (20 Cal/fl oz). 

• Highly absorbed fat blend, with medium-chain triglycerides 
(MCT oil) to help meet the special digestive needs of premature infants. 

• Higher levels of protein, vitamins and minerals per 100 Calories 
to extend the nutritional support initiated in-hospital. 
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• More calcium and phosphorus for improved bone mineralization. 

Ingredients: ®-D Com syrup solids, nonfat milk, lactose, whey protein 
concentrate, soy oil, high-oleic safflower oil, fractionated coconut oil (medium- 
chain triglycerides), coconut oil, potassium citrate, calcium phosphate tribasic, 

5 calcium carbonate, ascorbic acid, magnesium chloride, potassium chloride, 

sodium chloride, taurine, ferrous sulfate, m-inositol, choline chloride, ascorbyl 
palmitate, L-carnitine, alpha-tocopheryl acetate, zinc sulfate, niacinamide, 
mixed tocopherols, sodium citrate, calcium pantothenate, cupric sulfate, 
thiamine chloride hydrochloride, vitamin A palmitate, beta carotene, riboflavin, 

10 pyridoxine hydrochloride, folic acid, manganese sulfate, phylloquinone, biotin, 

sodium selenite, vitamin D3 and cyanocobalamin. 

G. Similac Natural Care Low-Iron Human Milk Fortifier Ready 
To Use, 24 Cal/fl oz. 

Usage: Designed to be mixed with human milk or to be fed alternatively 
1 5 with human milk to low-birth-weight infants. 

Ingredients: ®_-D Water, nonfat milk, hydrolyzed cornstarch, lactose, 
fractionated coconut oil (medium-chain triglycerides), whey protein 
concentrate, soil oil, coconut oil, calcium phosphate tribasic, potassium citrate, 
magnesium chloride, sodium citrate, ascorbic acid, calcium carbonate, mono- 

20 and diglycerides, soy lecithin, carrageenan, choline chloride, m-inositol, taurine, 

niacinamide, L-carnitine, alpha tocopheryl acetate, zinc sulfate, potassium 
chloride, calcium pantothenate, ferrous sulfate, cupric sulfate, riboflavin, 
vitamin A palmitate, thiamine chloride hydrochloride, pyridoxine 
hydrochloride, biotin, folic acid, manganese sulfate, phylloquinone, vitamin D 3) 

25 sodium selenite and cyanocobalamin. 

Various PUFAs of this invention can be substituted and/or added to the 
infant formulae described above and to other infant formulae known to those in 
the art.. 
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II. NUTRITIONAL FORMULATIONS 

A. ENSURE® 

Usage: ENSURE is a low-residue liquid food designed primarily as an 
oral nutritional supplement to be used with or between meals or, in appropriate 
5 amounts, as a meal replacement. ENSURE is lactose- and gluten-free, and is 

suitable for use in modified diets, including low-cholesterol diets. Although it 
is primarily an oral supplement, it can be fed by tube. 

Patient Conditions: 

• For patients on modified diets 

1 0 • For elderly patients at nutrition risk 

• For patients with involuntary weight loss 

• For patients recovering from illness or surgery 

• For patients who need a low-residue diet 
Ingredients: 

1 5 ®-D Water, Sugar (Sucrose), Maltodextrin (Corn), Calcium and Sodium 

Caseinates, High-Oleic Safflower Oil, Soy Protein Isolate, Soy Oil, Canola Oil, 
Potassium Citrate, Calcium Phosphate Tribasic, Sodium Citrate, Magnesium 
Chloride, Magnesium Phosphate Dibasic, Artificial Flavor, Sodium Chloride, 
Soy Lecithin, Choline Chloride, Ascorbic Acid, Carrageenan, Zinc Sulfate, 

20 Ferrous Sulfate, Alpha-Tocopheryl Acetate, Gellan Gum, Niacinamide, 

Calcium Pantothenate, Manganese Sulfate, Cupric Sulfate, Vitamin A 
Palmitate, Thiamine Chloride Hydrochloride, Pyridoxine Hydrochloride, 
Riboflavin, Folic Acid, Sodium Molybdate, Chromium Chloride, Biotin, 
Potassium Iodide, Sodium Selenate. 

25 

B. ENSURE® BARS 

Usage: ENSURE BARS are complete, balanced nutrition for 
supplemental use between or with meals. They provide a delicious, nutrient- 
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rich alternative to other snacks. ENSURE BARS contain <1 g lactose/bar, and 
Chocolate Fudge Brownie flavor is gluten-free. (Honey Graham Crunch flavor 
contains gluten.) 

Patient Conditions: 

5 • For patients who need extra calories, protein, vitamins and minerals 

• Especially useful for people who do not take in enough calories and 
nutrients 

• For people who have the ability to chew and swallow 

• Not to be used by anyone with a peanut allergy or any type of allergy to 
10 nuts. 

Ingredients: 

Honey Graham Crunch ~ High-Fructose Corn Syrup, Soy Protein: 
Isolate, Brown Sugar, Honey, Maltodextrin (Corn), Crisp Rice (Milled Rice, 
Sugar [Sucrose], Salt [Sodium Chloride] and Malt), Oat Bran, Partially 
15 Hydrogenated Cottonseed and Soy Oils, Soy Polysaccharide, Glycerine, Whey 

Protein Concentrate, Polydextrose, Fructose, Calcium Caseinate, Cocoa 
Powder, Artificial Flafors, Canola Oil, High-Oleic Safflower Oil, Nonfat Dry 
Milk, Whey Powder, Soy Lecithin and Corn Oil. Manufactured in a facility that 
processes nuts. 

20 Vitamins and Minerals: 

Calcium Phosphate Tribasic, Potassium Phosphate Dibasic, Magnesium 
Oxide, Salt (Sodium Chloride), Potassium Chloride, Ascorbic Acid, Ferric 
Orthophosphate, Alpha-Tocopheryl Acetate, Niacinamide, Zinc Oxide, Calcium 
Pantothenate, Copper Gluconate, Manganese Sulfate, Riboflavin, Beta- 
25 Carotene, Pyridoxine Hydrochloride, Thiamine Mononitrate, Folic Acid, Biotin, 

Chromium Chloride, Potassium Iodide, Sodium Selenate, Sodium Molybdate, 
Phylloquinone, Vitamin D3 and Cyanocobalamin. 
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Protein: 

Honey Graham Crunch - The protein source is a blend of soy protein isolate 
and milk proteins. 

Soy protein isolate 74% 
5 Milk proteins 26% 

Fat: 

Honey Graham Crunch - The fat source is a blend of partially 
hydrogenated cottonseed and soybean, canola, high oleic safflower, and corn 
oils, and soy lecithin. 

0 Partially hydrogenated cottonseed and soybean oil 76% 



Canola oil 8% 

High-oleic safflower oil 8% 

Corn oil 40^ 

Soy lecithin 4% 

15 Carbohydrate: 



Honey Graham Crunch - The carbohydrate source is a combination of 
high-fructose corn syrup, brown sugar, maltodextrin, honey, crisp rice, 
glycerine, soy polysaccharide, and oat bran. 



High-fructose corn syrup 24% 

20 Brown sugar 2 1 % 

Maltodextrin 12% 

Honey ip/ 0 

Crisp rice 9% 

Glycerine 9% 

25 Soy polysaccharide 704 

Oat bran 7%\ 
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C. ENSURE® HIGH PROTEIN 

Usage: ENSURE HIGH PROTEIN is a concentrated, high-protein 
liquid food designed for people who require additional calories, protein, 
vitamins, and minerals in their diets. It can be used as an oral nutritional 
5 supplement with or between meals or, in appropriate amounts, as a meal 

replacement. ENSURE HIGH PROTEIN is lactose- and gluten-free, and is 
suitable for use by people recovering from general surgery or hip fractures and 
by patients at risk for pressure ulcers. 

Patient Conditions 

10 • For patients who require additional calories, protein, vitamins, and minerals, 

such as patients recovering from general surgery or hip fractures, patients at risk 
for pressure ulcers, and patients on low-cholesterol diets 

Features- 

• Low in saturated fat 

15 • Contains 6 g of total fat and < 5 mg of cholesterol per serving 

• Rich, creamy taste 

• Excellent source of protein, calcium, and other essential vitamins and 
minerals 

• For low-cholesterol diets 
20 • Lactose-free, easily digested 

Ingredients: 

Vanilla Supreme: -®-D Water, Sugar (Sucrose), Maltodextrin (Corn), Calcium 
and Sodium Caseinates, High-Oleic Safflower Oil, Soy Protein Isolate, Soy Oil, 
Canola Oil, Potassium Citrate, Calcium Phosphate Tribasic, Sodium Citrate, 
25 Magnesium Chloride, Magnesium Phosphate Dibasic, Artificial Flavor, Sodium 

Chloride, Soy Lecithin, Choline Chloride, Ascorbic Acid, Carrageenan, Zinc 
Sulfate, Ferrous Suffate, Alpha-Tocopheryl Acetate, Gellan Gum, Niacinamide, 
Calcium Pantothenate, Manganese Sulfate, Cupric Sulfate, Vitamin A 
Palmitate, Thiamine Chloride Hydrochloride, Pyridoxine Hydrochloride, 

-116- 



WO 98/46764 




PCT/US98/07421 



Riboflavin, Folio Acid, Sodium Motybdate, Chromium Chloride, Biotin, 
Potassium Iodide, Sodium Selenate, Phylloquinone, Vitamin D.3 and 
Cyanocobalarnin. 

Protein: 

5 The protein source is a blend of two high-biologic-value proteins: casein and 

soy. 

Sodium and calcium caseinates 85% 
Soy protein isolate 1 5% 

Fat: 

10 The fat source is a blend of three oils: high-oleic safflower, canola, and soy. 

High-oleic safflower oil 40% 

Canola oil 30% 

Soy oil 30% 

The level of fat in ENSURE HIGH PROTEIN meets American Heart 
1 5 Association (AHA) guidelines. The 6 grams of fat in ENSURE HIGH 

PROTEIN represent 24% of the total calories, with 2.6% of the fat being from 
saturated fatty acids and 7.9% from polyunsaturated fatty acids. These values 
are within the AHA guidelines of < 30% of total calories from fat, < 1 0% of the 
calories from saturated fatty acids, and < 1 0% of total calories from 
20 polyunsaturated fatty acids. 

Carbohydrate: 

ENSURE HIGH PROTEIN contains a combination of maltodextrin and 
sucrose. The mild sweetness and flavor variety (vanilla supreme, chocolate 
royal, wild berry, and banana), plus VARI-FLAVORSO® Flavor Pacs in pecan, 
25 cherry, strawberry, lemon, and orange, help to prevent flavor fatigue and aid in 

patient compliance. 

Vanilla and other nonchocolate flavors 

Sucrose 60% 
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Maltodextrin 40% 
Chocolate 

Sucrose 70% 
Maltodextrin 30% 

5 

D. ENSURE ® LIGHT 

Usage: ENSURE LIGHT is a low-fat liquid food designed for use as an 
oral nutritional supplement with or between meals. ENSURE LIGHT is 
lactose- and gluten-free, and is suitable for use in modified diets, including low- 
10 cholesterol diets. 

Patient Conditions: 

• For normal-weight or overweight patients who need extra nutrition in a 
supplement that contains 50% less fat and 20% fewer calories than ENSURE 

• For healthy adults who don't eat right and need extra nutrition 
15 Features: 

• Low in fat and saturated fat 

• Contains 3 g of total fat per serving and < 5 mg cholesterol 

• Rich, creamy taste 

• Excellent source of calcium and other essential vitamins and minerals 
20 • For low-cholesterol diets 

• Lactose-free, easily digested 
Ingredients: 

French Vanilla: ©-D Water, Maltodextrin (Corn), Sugar (Sucrose), Calcium 
Caseinate, High-Oleic Safflower Oil, Canola Oil, Magnesium Chloride, Sodium 
25 Citrate, Potassium Citrate, Potassium Phosphate Dibasic, Magnesium Phosphate 

Dibasic, Natural and Artificial Flavor, Calcium Phosphate Tribasic, Cellulose 
Gel, . Choline Chloride, Soy Lecithin, Carrageenan, Salt (Sodium Chloride), 

-118- 



WO 98/46764 




PC17US98/07421 



Ascorbic Acid, Cellulose Gum, Ferrous Sulfate, Alpha-Tocopheryl Acetate, 
Zinc Sulfate, Niacinamide, Manganese Sulfate, Calcium Pantothenate, Cupric 
Sulfate, Thiamine Chloride Hydrochloride, Vitamin A Palmitate, Pyridoxine 
Hydrochloride, Riboflavin, Chromium Chloride, Folic Acid, Sodium 
5 Molybdate, Biotin, Potassium Iodide, Sodium Selenate, Phylloquinone, Vitamin 

D3 and Cyanocobalamin. 

Protein: 

The protein source is calcium caseinate. 

Calcium caseinate 100% 

10 Fat 

The fat source is a blend of two oils: high-oleic safflower and canola. 

High-oleic safflower oil 70% 

Canola oil 30% 

The level of fat in ENSURE LIGHT meets American Heart Association 
1 5 (AHA) guidelines. The 3 grams of fat in ENSURE LIGHT represent 1 3 .5% of 

the total calories, with 1.4% of the fat being from saturated fatty acids and 2.6% 
from polyunsaturated fatty acids. These values are within the AHA guidelines 
of < 30% of total calories from fat, < 1 0% of the calories from saturated fatty 
acids, and < 1 0% of total calories from polyunsaturated fatty acids. 

20 Carbohydrate 

ENSURE LIGHT contains a combination of maltodextrin and sucrose. 
The chocolate flavor contains com syrup as well. The mild sweetness and 
flavor variety (French vanilla, chocolate supreme, strawberry swirl), plus 
VARI-FLAVORS® Flavor Pacs in pecan, cherry, strawberry, lemon, and 
25 orange, help to prevent flavor fatigue and aid in patient compliance. 

Vanilla and other nonchocolate flavors 

Sucrose 51% 
Maltodextrin 49% 
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Chocolate 

Sucrose 47.0% 

Corn Syrup 26.5% 

Maltodextrin 26,5% 

5 Vitamins and Minerals 

An 8-fl-oz serving of ENSURE LIGHT provides at least 25% of the 
RDIs for 24 key vitamins and minerals. 

Caffeine 

Chocolate flavor contains 2.1 mg caffeine/8 fl oz. 

10 

E. ENSURE PLUS® 

Usage: ENSURE PLUS is a high-calorie, low-residue liquid food for 
use when extra calories and nutrients, but a normal concentration of protein, are 
needed. It is designed primarily as an oral nutritional supplement to be used 
1 5 with or between meals or, in appropriate amounts, as a meal replacement. 

ENSURE PLUS is lactose- and gluten-free. Although it is primarily an oral 
nutritional supplement, it can be fed by tube. 

Patient Conditions: 

• For patients who require extra calories and nutrients, but a normal 
20 concentration of protein, in a limited volume 

• For patients who need to gain or maintain healthy weight 
Features 

• Rich, creamy taste 

• Good source of essential vitamins and minerals 

25 Ingredients 

Vanilla: ®-D Water, Corn Syrup, Maltodextrin (Com), Corn Oil, Sodium and 
Calcium Casemates, Sugar (Sucrose), Soy Protein Isolate, Magnesium Chloride, 
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Potassium Citrate, Calcium Phosphate Tribasic, Soy Lecithin, Natural and 
Artificial Flavor, Sodium Citrate, Potassium Chloride, Choline Chloride, 
Ascorbic Acid, Carrageenan, Zinc Sulfate, Ferrous Sulfate, Alpha-Tocopheryl 
Acetate, Niacinamide, Calcium Pantothenate, Manganese Sulfate, Cupric 
5 Sulfate, Thiamine Chloride Hydrochloride, Pyridoxine Hydrochloride, 

Riboflavin, Vitamin A Palmitate, Folic Acid, Biotin, Chromium Chloride, 
Sodium Molybdate, Potassium Iodide, Sodium Selenite, Phylloquinone, 
Cyanocobalamin and Vitamin D 3 . 

Protein 

10 The protein source is a blend of two high-biologic-value proteins: casein 

and soy. 

Sodium and calcium caseinates 84% 
Soy protein isolate 16% 

Fat 

1 5 The fat source is com oil. 

Com oil 100% 
Carbohydrate 

ENSURE PLUS contains a combination of maltodextrin and sucrose. 
The mild sweetness and flavor variety (vanilla, chocolate, strawberry, coffee, 
20 buffer pecan, and eggnog), plus VARI-FLAVORS® Flavor Pacs in pecan, 

cherry, strawberry, lemon, and orange, help to prevent flavor fatigue and aid in 
patient compliance. 

Vanilla, strawberry, butter pecan, and coffee flavors 



Com Syrup 29% 

25 Maltodextrin 3g% 

Sucrose 23% 
Chocolate and eggnog flavors 

Com Syrup 36 o /o 
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Maltodextrin 34% 

Sucrose 30% 

Vitamins and Minerals 

An 8-fl-oz serving of ENSURE PLUS provides at least 15% of the RDIs 
5 for 25 key Vitamins and minerals. 

Caffeine 

Chocolate flavor contains 3.1 mg CafFeine/8 fl oz. Coffee flavor 
contains a trace amount of caffeine. 



1 0 F. ENSURE PLUS® HN 

Usage: ENSURE PLUS HN is a nutritionally complete high-calorie, 
high-nitrogen liquid food designed for people with higher calorie and protein 
needs or limited volume tolerance. It may be used for oral supplementation or 
for total nutritional support by tube. ENSURE PLUS HN is lactose- and gluten- 
15 free. 

Patient Conditions: 

• For patients with increased calorie and protein needs, such as following 
surgery or injury 

• For patients with limited volume tolerance and early satiety 
20 Features 

• For supplemental or total nutrition 

• For oral or tube feeding 

• 1.5CaVmL 

• High nitrogen 

25 • Calorically dense 
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Ingredients 

Vanilla: ®-D Water, Maltodextrin (Corn), Sodium and Calcium Casemates, 
Corn Oil, Sugar (Sucrose), Soy Protein Isolate, Magnesium Chloride, Potassium 
Citrate, Calcium Phosphate Tribasic, Soy Lecithin, Natural and Artificial 
Flavor, Sodium Citrate, Choline Chloride, Ascorbic Acid, Taurine, L-Carnitine, 
Zinc Sulfate, Ferrous Sulfate, Alpha-Tocopheryl Acetate, Niacinamide, 
Carrageenan, Calcium Pantothenate, Manganese Sulfate, Cupric Sulfate, 
Thiamine Chloride Hydrochloride, Pyridoxine Hydrochloride, Riboflavin, 
Vitamin A Palmitate, Folic Acid, Biotin, Chromium Chloride, Sodium 
Molybdate, Potassium Iodide, Sodium Selenite, Phylloquinone, 
Cyanocobalamin and Vitamin D3. 

G. ENSURE® POWDER 

Usage: ENSURE POWDER (reconstituted with water) is a low-residue 
liquid food designed primarily as an oral nutritional supplement to be used with 
or between meals. ENSURE POWDER is lactose- and gluten-free, and is 
suitable for use in modified diets, including low-cholesterol diets. 
Patient Conditions: 

• For patients on modified diets 

• For elderly patients at nutrition risk 

• For patients recovering from illness/surgery 

• For patients who need a low-residue diet 
Features 

• Convenient, easy to mix 

• Low in saturated fat 

• Contains 9 g of total fat and < 5 mg of cholesterol per serving 

• High in vitamins and minerals 

• For low-cholesterol diets 
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• Lactose-free, easily digested 

Ingredients: ®-D Corn Syrup, Maltodextrin (Corn), Sugar (Sucrose), Corn Oil, 
Sodium and Calcium Caseinates, Soy Protein Isolate, Artificial Flavor, 
Potassium Citrate, Magnesium Chloride, Sodium Citrate, Calcium Phosphate 

5 Tribasic, Potassium Chloride, Soy Lecithin, Ascorbic Acid, Choline Chloride, 

Zinc Sulfate, Ferrous Sulfate, Alpha-Tocopheryi Acetate, Niacinamide, 
Calcium Pantothenate, Manganese Sulfate, Thiamine Chloride Hydrochloride, 
Cupric Sulfate, Pyridoxine Hydrochloride, Riboflavin, Vitamin A Palmitate, 
Folic Acid, Biotin, Sodium Molybdate, Chromium Chloride, Potassium Iodide, 

1 0 Sodium Selenate, Phylloquinone, Vitamin D 3 and Cyanocobalamin. 

Protein 

The protein source is a blend of two high-biologic-value proteins: casein 
and soy. 

Sodium and calcium caseinates 84% 
1 5 Soy protein isolate 1 6% 

Fat 

The fat source is corn oil. 

Com oil 100% 
Carbohydrate 

20 ENSURE POWDER contains a combination of corn syrup, 

maltodextrin, and sucrose. The mild sweetness of ENSURE POWDER, plus 
VARI-FLAVORS® Flavor Pacs in pecan, cherry, strawberry, lemon, and 
orange, helps to prevent flavor fatigue and aid in patient compliance. 



Vanilla 

25 Com Syrup 35% 

Maltodextrin 35% 

Sucrose 30% 
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H. ENSURE® PUDDING 

Usage: ENSURE PUDDING is a nutrient-dense supplement providing 
balanced nutrition in a nonliquid form to be used with or between meals. It is 
appropriate for consistency-modified diets (e.g., soft, pureed, or full liquid) or 
5 for people with swallowing impairments. ENSURE PUDDING is gluten-free. 

Patient Conditions: 

• For patients on consistency-modified diets (e.g., soft, pureed, or full liquid) 

• For patients with swallowing impairments 
Features 

10 • Rich and creamy, good taste 

• Good source of essential vitamins and minerals Convenient-needs no 
refrigeration 

• Gluten-free 

Nutrient Profile per 5 oz: Calories 250, Protein 10.9%, Total Fat 34.9%, 
1 5 Carbohydrate 54.2% 

Ingredients: 

Vanilla: ®-D Nonfat Milk, Water, Sugar (Sucrose), Partially Hydrogenated 
Soybean Oil, Modified Food Starch, Magnesium Sulfate. Sodium Stearoyl 
Lactylate, Sodium Phosphate Dibasic, Artificial Flavor, Ascorbic Acid, Zinc 
20 Sulfate, Ferrous Sulfate, Alpha-Tocopheryl Acetate, Choline Chloride, 

Niacinamide, Manganese Sulfate, Calcium Pantothenate, FD&C Yellow #5, 
Potassium Citrate, Cupric Sulfate, Vitamin A Palmitate, Thiamine Chloride 
Hydrochloride, Pyridoxine Hydrochloride, Riboflavin, FD&C Yellow #6, Folic 
Acid, Biotin, Phylloquinone, Vitamin D3 and Cyanocobalamin. 

25 Protein 

The protein source is nonfat milk. 

Nonfat milk 100% 
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Fat 

The fat source is hydrogenated soybean oil. 
Hydrogenated soybean oil 1 00% 

Carbohydrate 

5 ENSURE PUDDING contains a combination of sucrose and modified 

food starch. The mild sweetness and flavor variety (vanilla, chocolate, 
butterscotch, and tapioca) help prevent flavor fatigue. The product contains 9.2 
grams of lactose per serving. 

Vanilla and other nonchocolate flavors 



10 Sucrose 56% 

Lactose 27% 

Modified food starch 1 7% 
Chocolate 

Sucrose 58% 

15 Lactose 26% 

Modified food starch 1 6% 



I. ENSURE® WITH FIBER 

Usage: ENSURE WITH FIBER is a fiber-containing, nutritionally 
complete liquid food designed for people who can benefit from increased 
dietary fiber and nutrients. ENSURE WITH FIBER is suitable for people who 
do not require a low-residue diet. It can be fed orally or by tube, and can be 
used as a nutritional supplement to a regular diet or, in appropriate amounts, as 
a meal replacement. ENSURE WITH FIBER is lactose- and gluten-free, and is 
suitable for use in modified diets, including low-cholesterol diets. 

Patient Conditions 

• For patients who can benefit from increased dietary fiber and nutrients 
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Features 

• New advanced formula-low in saturated fat, higher in vitamins and minerals 

• Contains 6 g of total fat and < 5 mg of cholesterol per serving 

• Rich, creamy taste 

5 • Good source of fiber 

• Excellent source of essential vitamins and minerals 

• For low-cholesterol diets 

• Lactose- and gluten-free 
Ingredients 

10 Vanilla: ©-D Water, Maltodextrin (Com), Sugar (Sucrose), Sodium and 
Calcium Caseinates, Oat Fiber, High-Oleic Safflower Oil, Canola Oil, Soy 
Protein Isolate, Com Oil, Soy Fiber, Calcium Phosphate Tribasic, Magnesium 
Chloride, Potassium Citrate, Cellulose Gel, Soy Lecithin, Potassium Phosphate 
Dibasic, Sodium Citrate, Natural and Artificial Flavors, Choline Chloride, 

15 Magnesium Phosphate, Ascorbic Acid, Cellulose Gum, Potassium Chloride, 

Carrageenan, Ferrous Sulfate, Alpha-Tocopheryl Acetate, Zinc Sulfate, 
Niacinamide, Manganese Sulfate, Calcium Pantothenate, Cupric Sulfate, 
Vitamin A Palmitate, Thiamine Chloride Hydrochloride, Pyridoxine 
Hydrochloride, Riboflavin, Folic Acid, Chromium Chloride, Biotin, Sodium 

20 Molybdate, Potassium Iodide, Sodium Selenate, Phylloquinone, Vitamin D 3 and 

Cyanocobalamin. 

Protein 

The protein source is a blend of two high-biologic-value proteins- casein 
and soy. 

25 Sodium and calcium caseinates 80% 

Soy protein isolate 20% 
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Fat 

The fat source is a blend of three oils: high-oleic safflower, canola, and 



corn. 

High-oleic safflower oil 40% 

5 Canola oil 40% 

Corn oil 20% 



The level of fat in ENSURE WITH FIBER meets American Heart 
Association (AHA) guidelines. The 6 grams of fat in ENSURE WITH FIBER 
represent 22% of the total calories, with 2.01 % of the fat being from saturated 
10 fatty acids and 6.7% from polyunsaturated fatty acids. These values are within 

the AHA guidelines of < 30% of total calories from fat, < 1 0% of the calories 
from saturated fatty acids, and < 1 0% of total calories from polyunsaturated 
fatty acids. 

Carbohydrate 

1 5 ENSURE WITH FIBER contains a combination of maltodextrin and 

sucrose. The mild sweetness and flavor variety (vanilla, chocolate, and butter 
pecan), plus VARI-FLAVORS® Flavor Pacs in pecan, cherry, strawberry, 
lemon, and orange, help to prevent flavor fatigue and aid in patient compliance. 

Vanilla and other nonchocolate flavors 



20 Maltodextrin 66% 

Sucrose 25% 

Oat Fiber 70/0 

Soy Fiber 2% 
Chocolate 

25 Maltodextrin 55% 

Sucrose 36% 

Oat Fiber 7% 
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Soy Fiber 2% 

Fiber 

The fiber blend used in ENSURE WITH FIBER consists of oat fiber and 
soy polysaccharide. This blend results in approximately 4 grams of total dietary 
5 fiber per 8-fl-oz can. The ratio of insoluble to soluble fiber is 95:5. 

The various nutritional supplements described above and known to 
others of skill in the art can be substituted and/or supplemented with the PUFAs 
of this invention. 

J- Oxepa™ Nutritional Product 

10 Oxepa is low-carbohydrate, calorically dense enteral nutritional product 

designed for the dietary management of patients with or at risk for ARDS. It 
has a unique combination of ingredients, including a patented oil blend 
containing eicosapentaenoic acid (EPA from fish oil), y-linolenic acid (GLA 
from borage oil), and elevated antioxidant levels. 

1 5 Caloric Distribution: 

• Caloric density is high at 1 .5 Cal/mL (355 Cal/8 fl oz), to minimize the 
volume required to meet energy needs. 



The distribution of Calories in Oxepa is shown in Table 7. 



Table 7. Caloric Distribution of Oxepa 




per 8 fl oz. 


per liter 


%ofCal 


Calories 


355 


1,500 




Fat(g) 


22.2 


93.7 


55.2 


Carbohydrate (g) 


25 


105.5 


28.1 


Protein (g) 


14.8 


62.5 


16.7 


Water (g) 


186 


785 





20 Fat: 

• Oxepa contains 22.2 g of fat per 8-fl oz serving (93 .7 g/L). 

• The fat source is a oil blend of 3 1 .8% canola oil, 25% medium-chain 
triglycerides (MCTs), 20% borage oil, 20% fish oil, and 3.2 % soy lecithin. The 
typical fatty acid profile of Oxepa is shown in Table 8. 
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• Oxepa provides a balanced amount of polyunsaturated, monounsaturated, 
and saturated fatty acids, as shown in Table 10. 

• Medium-chain trigylcerides (MCTs) 25% of the fat blend - aid gastric 
emptying because they are absorbed by the intestinal tract without 

5 emulsification by bile acids. 

The various fatty acid components of Oxepa™ nutritional product can 
be substituted and/or supplemented with the PUFAs of this invention. 



10 



Table 8. Typical Fatty Acid Profile 




% Total Fatty 
Acids 


g/8 fl oz* 


g/L* 


Caproic (6:0) 


0.2 


0.04 


0.18 


Caprylic (8:0) 


14.69 


3.1 


13.07 


Capric(10:0) 


11.06 


2.33 


9.87 


Palmitic (16:0) 


5.59 


1.18 


4.98 


Palmitoleic(16:ln-7) 


1.82 


0.38 


1.62 


Stearic (18:0) 


1.84 


0.39 


1.64 


01eic(18:ln-9) 


24.44 


5.16 


21.75 


Linoleic (18:2n-6) 


16.28 


3.44 


14.49 


a-Linolenic (18:3n-3) 


3.47 


0.73 


3.09 


y-Linolenic (18:3n-6) 


4.82 


1.02 


4.29 


Eicosapentaenoic (20:5n- 
3) 


5.11 


1.08 


4.55 


n-3-Docosapentaenoic 
(22:5n-3) 


0.55 


0.12 


0.49 


Docosahexaenoic (22:6n- 
3) 


2.27 


0.48 


2.02 


Others 


7.55 


1.52 


6.72 


* Fatty acids equal approximately 95% of total fat. 


Table 9. Fat Profile of Oxepa. 


% of total calories from fat 


55.2 


Polyunsaturated fatty acids 


31.44 g/L 


Monounsaturated fatty acids 


25.53 g/L 


Saturated fatty acids 


32.38 g/L 


n-6 to n-3 ratio 


1.75:1 


Cholesterol 


9.49 mg/8 fl oz 
40.1 mg/L 
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Carbohydrate: 

• The carbohydrate content is 25.0 g per 8-fl-oz serving (105.5 g/L). 

• The carbohydrate sources are 45% maltodextrin (a complex carbohydrate) 
and 55% sucrose (a simple sugar), both of which are readily digested and 

5 absorbed. 

• The high-fat and low-carbohydrate content of Oxepa is designed to 
minimize carbon dioxide (CO2) production. High CO2 levels can complicate 
weaning in ventilator-dependent patients. The low level of carbohydrate also 
may be useful for those patients who have developed stress-induced 

10 hyperglycemia. 

• Oxepa is lactose-free. 

Dietary carbohydrate, the amino acids from protein, and the glycerol 
moiety of fats can be converted to glucose within the body. Throughout this 
process, the carbohydrate requirements of glucose-dependent tissues (such as 

1 5 the central nervous system and red blood cells) are met. However, a diet free of 

carbohydrates can lead to ketosis, excessive catabolism of tissue protein, and 
loss of fluid and electrolytes. These effects can be prevented by daily ingestion 
of 50 to 100 g of digestible carbohydrate, if caloric intake is adequate. The - 
carbohydrate level in Oxepa is also sufficient to minimize gluconeogenesis, if 

20 energy needs are being met. 

Protein: 

• Oxepa contains 14.8 g of protein per 8-fl-oz serving (62.5 g/L). 

• The total calorie/nitrogen ratio (150: 1) meets the need of stressed patients. 

• Oxepa provides enough protein to promote anabolism and the maintenance 
25 of lean body mass without precipitating respiratory problems. High protein 

intakes are a concern in patients with respiratory insufficiency. Although 
protein has little effect on C0 2 production, a high protein diet will increase 
ventilatory drive. 



-131- 



WO 98/46764 



PCT/US98/07421 



• The protein sources of Oxepa are 86.8% sodium caseinate and 13.2% 
calcium caseinate. 

• As demonstrated in Table 1 1, the amino acid profile of the protein system in 
Oxepa meets or surpasses the standard for high quality protein set by 

5 theNational Academy of Sciences. 

• Oxepa is gluten-free. 



All publications and patent applications mentioned in this specification 
are indicative of the level of skill of those skilled in the art to which this 
10 invention pertains. All publications and patent applications are herein 

incorporated by reference to the same extent as if each individual publication or 
patent application was specifically and individually indicated to be incorporated 
by reference. 

The invention now being fully described, it will be apparent to one of 
1 5 ordinary skill in the art that many changes and modifications can be made 

thereto without departing from the spirit or scope of the appended claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION : 



(i) APPLICANT: KNUTZON, DEBORAH 
MURKER JI, PRADIP 
HUANG , YUNG-SHENG 
THURMOND, JENNIFER 
CHAUDHARY, SUNITA 
LEONARD, AMANDA 



(ii) TITLE OF INVENTION: METHODS AND COMPOSITIONS FOR SYNTHESIS 
OF LONG CHAIN POLY-UNSATURATED FATTY ACIDS IN PLANTS 

(iii) NUMBER OF SEQUENCES: 52 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: LIMBACH & LIMBACH L.L.P. 

(B) STREET: 2001 FERRY BUILDING 

(C) CITY: SAN FRANCISCO 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 94111 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE.- Microsoft Word 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/834,033 

(B) FILING DATE: ll-APR-1997 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/833,610 

(B) FILING DATE: ll-APR-1997 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: MICHAEL R. WARD 

(B) REGISTRATION NUMBER: 38,351 

(C) REFERENCE/ DOCKET NUMBER: CGAB-320 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (415) 433-4150 

(B) TELEFAX: (415) 433-8716 

(C) TELEX: N/A 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1617 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CGACACTCCT TCCTTCTTCT CACCCGTCCT AGTCCCCTTC AACCCCCCTC TTTGACAAAG 60 

ACAACAAACC ATGGCTGCTG CTCCCAGTGT GAGGACGTTT ACTCGGGCCG AGGTTTTGAA 120 

TGCCGAGGCT CTGAATGAGG GCAAGAAGGA TGCCGAGGCA CCCTTCTTGA TGATCATCGA 180 

CAACAAGGTG TACGATGTCC GCGAGTTCGT CCCTGATCAT CCCGGTGGAA GTGTGATTCT 240 

20 CACGCACGTT GGCAAGGACG GCACTGACGT CTTTGACACT TTTCACCCCG AGGCTGCTTG 300 

GGAGACTCTT GCCAACTTTT ACGTTGGTGA TATTGACGAG AGCGACCGCG ATATCAAGAA 360 

TGATGACTTT GCGGCCGAGG TCCGCAAGCT GCGTACCTTG TTCCAGTCTC TTGGTTACTA 420 

25 

CGATTCTTCC AAGGCATACT ACGCCTTCAA GGTCTCGTTC AACCTCTGCA TCTGGGGTTT 4 80 

GTCGACGGTC ATTGTGGCCA AGTGGGGCCA GACCTCGACC CTCGCCAACG TGCTCTCGGC 540 

30 TGCGCTTTTG GGTCTGTTCT GGCAGCAGTG CGGATGGTTG GCTCACGACT TTTTGCATCA 600 

CCAGGTCTTC CAGGACCGTT TCTGGGGTGA TCTTTTCGGC GCCTTCTTGG GAGGTGTCTG 660 

CCAGGGCTTC TCGTCCTCGT GGTGGAAGGA CAAGCACAAC ACTCACCACG CCGCCCCCAA 720 

35 

CGTCCACGGC GAGGATCCCG ACATTGACAC CCACCCTCTG TTGACCTGGA GTGAGCATGC 780 

GTTGGAGATG TTCTCGGATG TCCCAGATGA GGAGCTGACC CGCATGTGGT CGCGTTTCAT 840 

40 GGTCCTGAAC CAGACCTGGT TTTACTTCCC CATTCTCTCG TTTGCCCGTC TCTCCTGGTG 900 

CCTCCAGTCC ATTCTCTTTG TGCTGCCTAA CGGTCAGGCC CACAAGCCCT CGGGCGCGCG 960 

TGTGCCCATC TCGTTGGTCG AGCAGCTGTC GCTTGCGATG CACTGGACCT GGTACCTCGC 1020 

45 

CACCATGTTC CTGTTCATCA AGGATCCCGT CAACATGCTG GTGTACTTTT TGGTGTCGCA 1080 

GGCGGTGTGC GGAAACTTGT TGGCGATCGT GTTCTCGCTC AACCACAACG GTATGCCTGT 1140 

50 GATCTCGAAG GAGGAGGCGG TCGATATGGA TTTCTTCACG AAGCAGATCA TCACGGGTCG 12 00 

TGATGTCCAC CCGGGTCTAT TTGCCAACTG GTTCACGGGT GGATTGAACT ATCAGATCGA 12 60 

^ GCACCACTTG TTCCCTTCGA TGCCTCGCCA CAACTTTTCA AAGATCCAGC CTGCTGTCGA 1320 

GACCCTGTGC AAAAAGTACA ATGTCCGATA CCACACCACC GGTATGATCG AGGGAACTGC 1380 

AGAGGTCTTT AGCCGTCTGA ACGAGGTCTC CAAGGCTGCC TCCAAGATGG GTAAGGCGCA 14 40 

60 GTAAAAAAAA AAACAAGGAC GTTTTTTTTC GCCAGTGCCT GTGCCTGTGC CTGCTTCCCT 1500 

TGTCAAGTCG AGCGTTTCTG GAAAGGATCG TTCAGTGCAG TATCATCATT CTCCTTTTAC 1560 

-134- 



WO 98/46764 




PCT/US98/07421 



CCCCCGCTCA TATCTCATTC ATTTCTCTTA TTAAACAACT TGTTCCCCCC TTCACCG 1617 



5 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 457 amino acids 
10 (B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ala Ala Ala Pro Ser Val Arg Thr Phe Thr Arg Ala Glu Val Leu 

15 10 15 

n Asn Ala Glu Ala Leu Asn Glu Gly Lys Lys Asp Ala Glu Ala Pro Phe 

25 20 25 30 

Leu Met lie lie Asp Asn Lys Val Tyr Asp Val Arg Glu Phe Val Pro 

35 40 45 



30 



35 



40 



45 



50 



55 



60 



Asp His Pro Gly Gly Ser Val He Leu Thr His Val Gly Lys Asp Gly 
50 55 60 

Thr Asp Val Phe Asp Thr Phe His Pro Glu Ala Ala Trp Glu Thr Leu 
65 70 75 80 

Ala Asn Phe Tyr Val Gly Asp He Asp Glu Ser Asp Arg Asp He Lys 
85 90 95 

Asn Asp Asp Phe Ala Ala Glu Val Arg Lys Leu Arg Thr Leu Phe Gin 
100 105 HO 

Ser Leu Gly Tyr Tyr Asp Ser Ser Lys Ala Tyr Tyr Ala Phe Lys Val 
115 120 125 

Ser Phe Asn Leu Cys He Trp Gly Leu Ser Thr Val He Val Ala Lys 
130 135 140 

Trp Gly Gin Thr Ser Thr Leu Ala Asn Val Leu Ser Ala Ala Leu Leu 
145 150 155 160 

Gly Leu Phe Trp Gin Gin Cys Gly Trp Leu Ala His Asp Phe Leu His 
165 170 175 

His Gin Val Phe Gin Asp Arg Phe Trp Gly Asp Leu Phe Gly Ala Phe 
180 185 190 

Leu Gly Gly Val Cys Gin Gly Phe Ser Ser Ser Trp Trp Lys Asp Lys 
195 200 205 

His Asn Thr His His Ala Ala Pro Asn Val His Gly Glu Asp Pro Asp 
210 215 220 
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lie Asp Thr His Pro Leu Leu Thr Trp Ser Glu His Ala Leu Glu Met 
225 230 235 240 

Phe Ser Asp Val Pro Asp Glu Glu Leu Thr Arg Met Trp Ser Arg Phe 
245 250 255 

Met Val Leu Asn Gin Thr Trp Phe Tyr Phe Pro He Leu Ser Phe Ala 
260 265 270 

Arg Leu Ser Trp Cys Leu Gin Ser He Leu Phe Val Leu Pro Asn Gly 
275 280 285 

Gin Ala His Lys Pro Ser Gly Ala Arg Val Pro He Ser Leu Val Glu 
290 295 300 

Gin Leu Ser Leu Ala Met His Trp Thr Trp Tyr Leu Ala Thr Met Phe 
305 310 315 - 320 

Leu Phe He Lys Asp Pro Val Asn Met Leu Val Tyr Phe Leu Val Ser 
325 330 335 

Gin Ala Val Cys Gly Asn Leu Leu Ala He Val Phe Ser Leu Asn His 
340 345 350 

Asn Gly Met Pro Val He Ser Lys Glu Glu Ala Val Asp Met Asp Phe 
355 360 365 

Phe Thr Lys Gin He He Thr Gly Arg Asp Val His Pro Gly Leu Phe 
370 375 380 

Ala Asn Trp Phe Thr Gly Gly Leu Asn Tyr Gin He Glu His His Leu 
385 390 395 400 

Phe Pro Ser Met Pro Arg His Asn Phe Ser Lys He Gin Pro Ala Val 
405 410 415 

Glu Thr Leu Cys Lys Lys Tyr Asn Val Arg Tyr His Thr Thr Gly Met 
420 425 430 

He Glu Gly Thr Ala Glu Val Phe Ser Arg Leu Asn Glu Val Ser Lys 
435 440 445 

Ala Ala Ser Lys Met Gly Lys Ala Gin 
450 455 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1488 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GTCCCCTGTC GCTGTCGGCA CACCCCATCC TCCCTCGCTC CCTCTGCGTT TGTCCTTGGC i 
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10 



CCACCGTCTC TCCTCCACCC TCCGAGACGA CTGCAACTGT AATCAGGAAC CGACAAATAC 120 

ACGATTTCTT TTTACTCAGC ACCAACTCAA AATCCTCAAC CGCAACCCTT TTTCAGGATG 180 

GCACCTCCCA ACACTATCGA TGCCGGTTTG ACCCAGCGTC ATATCAGCAC CTCGGCCCCA 24 0 

AACTCGGCCA AGCCTGCCTT CGAGCGCAAC TACCAGCTCC CCGAGTTCAC CATCAAGGAG 300 

ATCCGAGAGT GCATCCCTGC CCACTGCTTT GAGCGCTCCG GTCTCCGTGG TCTCTGCCAC 360 

GTTGCCATCG ATCTGACTTG GGCGTCGCTC TTGTTCCTGG CTGCGACCCA GATCGACAAG 420 

TTTGAGAATC CCTTGATCCG CTATTTGGCC TGGCCTGTTT ACT GG AT CAT GCAGGGTATT 480 

15 GTCTGCACCG GTGTCTGGGT GCTGGCTCAC GAGTGTGGTC ATCAGTCCTT CTCGACCTCC 540 

AAGACCCTCA ACAACACAGT TGGTTGGATC TTGCACTCGA TGCTCTTGGT CCCCTACCAC 600 

TCCTGGAGAA TCTCGCACTC GAAGCACCAC AAGGCCACTG GCCATATGAC CAAGGACCAG 660 

20 

GTCTTTGTGC CCAAGACCCG CTCCCAGGTT GGCTTGCCTC CCAAGGAGAA CGCTGCTGCT 720 

GCCGTTCAGG AGGAGGACAT GTCCGTGCAC CTGGATGAGG AGGCTCCCAT TGTGACTTTG 780 

25 TTCTGGATGG TGATCCAGTT CTTGTTCGGA TGGCCCGCGT ACCTGATTAT GAACGCCTCT 840 

GGCCAAGACT ACGGCCGCTG GACCTCGCAC TTCCACACGT ACTCGCCCAT CTTTGAGCCC 900 

CGCAACTTTT TCGACATTAT TATCTCGGAC CTCGGTGTGT TGGCTGCCCT CGGTGCCCTG 960 

30 

ATCTATGCCT CCATGCAGTT GTCGCTCTTG ACCGTCACCA AGTACTATAT TGTCCCCTAC 1020 

CTCTTTGTCA ACTTTTGGTT GGTCCTGATC ACCTTCTTGC AGCACACCGA TCCCAAGCTG 1080 

35 CCCCATTACC GCGAGGGTGC CTGGAATTTC CAGCGTGGAG CTCTTTGCAC CGTTGACCGC 1140 

-TCGTTTGGCA AGTTCTTGGA CCATATGTTC CACGGCATTG TCCACACCCA TGTGGCCCAT 1200 

CACTTGTTCT CGCAAATGCC GTTCTACCAT GCTGAGGAAG CTACCTATCA TCTCAAGAAA 12 60 

40 

CTGCTGGGAG AGTACTATGT GTACGACCCA TCCCCGATCG TCGTTGCGGT CTGGAGGTCG 1320 

TTCCGTGAGT GCCGATTCGT GGAGGATCAG GGAGACGTGG TCTTTTTCAA GAAGTAAAAA 1380 

45 AAAAGACAAT GGACCACACA CAACCTTGTC TCTACAGACC TACGTATCAT GTAGCCATAC 14 40 

CACTTCATAA AAGAACATGA GCTCTAGAGG CGTGTCATTC GCGCCTCC 14 88 
(2) INFORMATION FOR SEQ ID NO: 4: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 399 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 
55 (D) TOPOLOGY: linear 



60 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
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5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 



Met Ala Pro Pro Asn Thr lie Asp Ala Gly Leu Thr Gin Arg His He 
15 10 15 

Ser Thr Ser Ala Pro Asn Ser Ala Lys Pro Ala Phe Glu Arg Asn Tyr 
20 25 30 

Gin Leu Pro Glu Phe Thr He Lys Glu He Arg Glu Cys He Pro Ala 
35 40 45 

His Cys Phe Glu Arg Ser Gly Leu Arg Gly Leu Cys His Val Ala He 
50 55 60 

Asp Leu Thr Trp Ala Ser Leu Leu Phe Leu Ala Ala Thr Gin He Asp 
65 70 75 80 

Lys Phe Glu Asn Pro Leu He Arg Tyr Leu Ala Trp Pro Val Tyr Trp 
85 90 95 

He Met Gin Gly He Val Cys Thr Gly Val Trp Val Leu Ala His Glu 
100 105 110 

Cys Gly His Gin Ser Phe Ser Thr Ser Lys Thr Leu Asn Asn Thr Val 
115 120 125 

Gly Trp He Leu His Ser Met Leu Leu Val Pro Tyr His Ser Trp Arg 
130 135 140 

He Ser His Ser Lys His His Lys Ala Thr Gly His Met Thr Lys Asp 
145 150 155 " 160 

Gin Val Phe Val Pro Lys Thr Arg Ser Gin Val Gly Leu Pro Pro Lys 
165 170 175 

Glu Asn Ala Ala Ala Ala Val Gin Glu Glu Asp Met Ser Val His Leu 
180 185 f 190 

Asp Glu Glu Ala Pro He Val Thr Leu Phe Trp Met Val He Gin Phe 
195 200 205 

Leu Phe Gly Trp Pro Ala Tyr Leu He Met Asn Ala Ser Gly Gin Asp 
210 215 220 

Tyr Gly Arg Trp Thr Ser His Phe His Thr Tyr Ser Pro He Phe Glu 
225 230 235 - 240 

Pro Arg-Asn Phe Phe Asp He He He Ser Asp Leu Gly Val Leu Ala 
245 250 J 255 

Ala Leu Gly Ala Leu He Tyr Ala Ser Met Gin Leu Ser Leu Leu Thr 
260 265 270 

Val Thr Lys Tyr Tyr He Val Pro Tyr Leu Phe Val Asn Phe Trp Leu 
275 280 285 

Val Leu He Thr Phe Leu Gin His Thr Asp Pro Lys Leu Pro His Tyr 
290 295 300 

Arg Glu Gly Ala Trp Asn Phe Gin Arg Gly Ala Leu Cys Thr Val Asp 



320 
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Arg Ser Phe Gly Lys Phe Leu Asp His Met Phe His Gly lie Val His 
325 330 335 

Thr His Val Ala His His Leu Phe Ser Gin Met Pro Phe Tyr His Ala 
5 340 345 350 

Glu Glu Ala Thr Tyr His Leu Lys Lys Leu Leu Gly Glu Tyr Tyr Val 
355 360 365 

10 Tyr Asp Pro Ser Pro lie Val Val Ala Val Trp Arg Ser Phe Arg Glu 

370 375 380 



15 



25 



30 



60 



Cys Arg Phe Val Glu Asp Gin Gly Asp Val Val Phe Phe Lys Lys 
385 390 395 

(2) INFORMATION FOR SEQ ID NO: 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1483 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

GCTTCCTCCA GTTCATCCTC CATTTCGCCA CCTGCATTCT TTACGACCGT TAAGCAAGAT 60 

GGGAACGGAC CAAGGAAAAA CCTTCACCTG GGAAGAGCTG GCGGCCCATA ACACCAAGGA 120 

35 CGACCTACTC TTGGCCATCC GCGGCAGGGT GTACGATGTC ACAAAGTTCT TGAGCCGCCA 180 

TCCTGGTGGA GTGGACACTC TCCTGCTCGG AGCTGGCCGA GATGTTACTC CGGTCTTTGA 24 0 

GATGTATCAC GCGTTTGGGG CTGCAGATGC CATTATGAAG AAGTACTATG TCGGTACACT 300 

40 

GGTCTCGAAT GAGCTGCCCA TCTTCCCGGA GCCAACGGTG TTCCACAAAA CCATCAAGAC 3 60 

GAGAGTCGAG GGCTACTTTA CGGATCGGAA CATTGATCCC AAGAATAGAC CAGAGATCTG 420 

45 GGGACGATAC GCTCTTATCT TTGGATCCTT GATCGCTTCC TACTACGCGC AGCTCTTTGT 4 80 

GCCTTTCGTT GTCGAACGCA CATGGCTTCA GGTGGTGTTT GCAATCATCA TGGGATTTGC 54 0 

^ GTGCGCACAA GTCGGACTCA ACCCTCTTCA TGATGCGTCT CACTTTTCAG TGACCCACAA 600 

CCCCACTGTC TGGAAGATTC TGGGAGCCAC GCACGACTTT TTCAACGGAG CATCGTACCT 660 

GGTGTGGATG TACCAACATA TGCTCGGCCA TCACCCCTAC ACCAACATTG CTGGAGCAGA 720 

55 TCCCGACGTG TCGACGTCTG AGCCCGATGT TCGTCGTATC AAGCCCAACC AAAAGTGGTT 7 80 

TGTCAACCAC ATCAACCAGC ACATGTTTGT TCCTTTCCTG TACGGACTGC TGGCGTTCAA 84 0 

GGTGCGCATT CAGGACATCA ACATTTTGTA CTTTGTCAAG ACCAATGACG CTATTCGTGT 900 

CAATCCCATC TCGACATGGC ACACTGTGAT GTTCTGGGGC GGCAAGGCTT TCTTTGTCTG 960 
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40 



45 



50 



55 



60 



GTATCGCCTG 


ATTGTTCCCC 


TGCAGTATCT 


GCCCCTGGGC 


AAGGTGCTGC 


TCTTGTTCAC 


1020 


GGTCGCGGAC 


ATGGTGTCGT 


CTTACTGGCT 


GGCGCTGACC 


TTCCAGGCGA 


ACCACGTTGT 


1080 


TGAGGAAGTT 


CAGTGGCCGT 


TGCCTGACGA 


GAACGGGATC 


ATCCAAAAGG 


ACTGGGCAGC 


1140 


TATGCAGGTC 


GAGACTACGC 


AGGATTACGC 


ACACGATTCG 


CACCTCTGGA 


CCAGCATCAC 


1200 


TGGCAGCTTG 


AACTACCAGG 


CTGTGCACCA 


TCTGTTCCCC 


AACGTGTCGC 


AGCACCATTA 


1260 


TCCCGATATT 


CTGGCCATCA 


TCAAGAACAC 


CTGCAGCGAG 


TACAAGGTTC 


CATACCTTGT 


1320 


CAAGGATACG 


TTTTGGCAAG 


CATTTGCTTC 


ACATTTGGAG 


CACTTGCGTG 


TTCTTGGACT 


1380 


CCGTCCCAAG 


GAAGAGTAGA 


AGAAAAAAAG 


CGCCGAATGA 


AGTATTGCCC 


CCTTTTTCTC 


1440 


CAAGAATGGC 


AAAAGGAGAT 


CAAGTGGACA 


TTCTCTATGA 


AGA 




1483 


(2) INFORMATION FOR SEQ ID NO: 6: 











10 



15 



20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 



Met Gly Thr Asp Gin Gly Lys Thr Phe Thr Trp Glu Glu Leu Ala Ala 
35 1 5 10 15 



His Asn Thr Lys Asp Asp Leu Leu Leu Ala He Arg Gly Arg Val Tyr 
20 25 30 

Asp Val Thr Lys Phe Leu Ser Arg His Pro Gly Gly Val Asp Thr Leu 
35 40 45 

Leu Leu Gly Ala Gly Arg Asp Val Thr Pro Val Phe Glu Met Tyr His 
50 55 60 

Ala Phe Gly Ala Ala Asp Ala He Met Lys Lys Tyr Tyr Val Gly Thr 
65 70 75 " 80 

Leu Val Ser Asn Glu Leu Pro He Phe Pro Glu Pro Thr Val Phe His 
85 90 95 

Lys Thr He Lys Thr Arg Val Glu Gly Tyr Phe Thr Asp Arg Asn lie 
100 105 110 

Asp Pro Lys Asn Arg Pro Glu He Trp Gly Arg Tyr Ala Leu He Phe 
115 120 125 

Gly Ser Leu He Ala Ser Tyr Tyr Ala Gin Leu Phe Val Pro Phe Val 
130 135 140 

Val Glu Arg Thr Trp Leu Gin Val Val Phe Ala He He Met Gly Phe 
I 45 150 155 160 
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5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 



Ala Cys Ala Gin Val Gly Leu Asn Pro Leu His Asp Ala Ser His Phe 
165 170 175 

Ser Val Thr His Asn Pro Thr Val Trp Lys lie Leu Gly Ala Thr His 
180 185 190 

Asp Phe Phe Asn Gly Ala Ser Tyr Leu Val Trp Met Tyr Gin His Met 
195 200 205 

Leu Gly His His Pro Tyr Thr Asn lie Ala Gly Ala Asp Pro Asp Val 
210 215 220 

Ser Thr Ser Glu Pro Asp Val Arg Arg He Lys Pro Asn Gin Lys Trp 
225 230 235 240 

Phe Val Asn His He Asn Gin His Met Phe Val Pro Phe Leu Tyr Gly 
245 250 255 

Leu Leu Ala Phe Lys Val Arg He Gin Asp He Asn He Leu Tyr Phe 
260 265 270 

Val Lys Thr Asn Asp Ala He Arg Val Asn Pro He Ser Thr Trp His 
275 280 285 

Thr Val Met Phe Trp Gly Gly Lys Ala Phe Phe Val Trp Tyr Arg Leu 
290 295 300 

He Val Pro Leu Gin Tyr Leu Pro Leu Gly Lys Val Leu Leu Leu Phe 
305 310 315 320 

Thr Val Ala Asp Met Val Ser Ser Tyr Trp Leu Ala Leu Thr Phe Gin 
325 330 335 

Ala Asn His Val Val Glu Glu Val Gin Trp Pro Leu Pro Asp Glu Asn 
340 345 350 

Gly He He Gin Lys Asp Trp Ala Ala Met Gin Val Glu Thr Thr Gin 
355 360 365 

Asp Tyr Ala His Asp Ser His Leu Trp Thr Ser He Thr Gly Ser Leu 
370 375 380 

Asn Tyr Gin Ala Val His His Leu Phe Pro Asn Val Ser Gin His His 
385 390 395 400 

Tyr Pro Asp He Leu Ala He He Lys Asn Thr Cys Ser Glu Tyr Lys 
405 410 " 415 

Val Pro Tyr Leu Val Lys Asp Thr Phe Trp Gin Ala Phe Ala Ser His 
420 425 430 

Leu Glu His Leu Arg Val Leu Gly Leu Arg Pro Lys Glu Glu 



(2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 355 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Glu Val Arg Lys Leu Arg Thr Leu Phe Gin Ser Leu Gly Tyr Tyr Asp 
15 10 15 

Ser Ser Lys Ala Tyr Tyr Ala Phe Lys Val Ser Phe Asn Leu Cys lie 
20 25 30 

Trp Gly Leu Ser Thr Val lie Val Ala Lys Trp Gly Gin Thr Ser Thr 
35 40 45 

Leu Ala Asn Val Leu Ser Ala Ala Leu Leu Gly Leu Phe Trp Gin Gin 
50 55 60 

Cys Gly Trp Leu Ala His Asp Phe Leu His His Gin Val Phe Gin Asp 
65 70 75 80 

Arg Phe Trp Gly Asp Leu Phe Gly Ala Phe Leu Gly Gly Val Cys Gin 
85 90 95 

Gly Phe Ser Ser Ser Trp Trp Lys Asp Lys His Asn Thr His His Ala 
100 105 " 110 

Ala Pro Asn Val His Gly Glu Asp Pro Asp lie Asp Thr His Pro Leu 
115 120 * 125 

Leu Thr Trp Ser Glu His Ala Leu Glu Met Phe Ser Asp Val Pro Asp 
130 135 140 

Glu Glu Leu Thr Arg Met Trp Ser Arg Phe Met Val Leu Asn Gin Thr 
145 150 155 160 

Trp Phe Tyr Phe Pro lie Leu Ser Phe Ala Arg Leu Ser Trp Cys Leu 
165 170 175 

Gin Ser lie Leu Phe Val Leu Pro Asn Gly Gin Ala His Lys Pro Ser 
180 185 190 

Gly -Ala Arg Val Pro He Ser Leu Val Glu Gin Leu Ser Leu Ala Met 
195 200 205 

His Trp Thr Trp Tyr Leu Ala Thr Met Phe Leu Phe He Lys Asp Pro 
210 215 220 

Val Asn Met Leu Val Tyr Phe Leu Val Ser Gin Ala Val Cys Gly Asn 
225 230 235 240 

Leu Leu Ala He Val Phe Ser Leu Asn His Asn Gly Met Pro Val He 
245 250 255 

Ser Lys Glu Glu Ala Val Asp Met Asp Phe Phe Thr Lys Gin He He 
260 265 270 

Thr Gly Arg Asp Val His Pro Gly Leu Phe Ala Asn Trp Phe Thr Gly 
275 280 285 
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Gly Leu Asn Tyr Gin lie Glu His His Leu Phe Pro Ser Met Pro Arg 
290 295 300 

His Asn Phe Ser Lys lie Gin Pro Ala Val Glu Thr Leu Cys Lys Lys 
5 305 310 315 320 

Tyr Asn Val Arg Tyr His Thr Thr Gly Met He Glu Gly Thr Ala Glu 
325 330 335 

10 Val Phe Ser Arg Leu Asn Glu Val Ser Lys Ala Ala Ser Lys Met Gly 

340 345 350 

Lys Ala Gin 
355 

15 

(2) INFORMATION FOR SEQ ID NO: 8: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

( D ) TOPOLOGY : linear 

MOLECULE TYPE: peptide 



20 



25 



(i) 



(ii) 



30 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Val Thr Leu Tyr Thr Leu Ala Phe Val Ala Ala Asn Ser Leu Gly Val 
15 10 15 



Leu Tyr Gly Val Leu Ala Cys Pro Ser Val Xaa Pro His Gin He Ala 

35 20 25 30 

Ala Gly Leu Leu Gly Leu Leu Trp He Gin Ser Ala Tyr He Gly Xaa 

35 40 45 

40 Asp Ser Gly His Tyr Val He Met Ser Asn Lys Ser Asn Asn Xaa Phe 

50 55 60 



Ala Gin Leu Leu Ser Gly Asn Cys Leu Thr Gly He He Ala Trp Trp 

65 70 75 80 

Lys Trp Thr His Asn Ala His His Leu Ala Cys Asn Ser Leu Asp Tyr 
85 90 95 



Gly Pro Asn Leu Gin His He Pro 
50 100 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 252 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

60 (ii) MOLECULE TYPE: peptide 



-143- 



WO 98/46764 




PCT/US98/07421 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Gly Val Leu Tyr Gly Val Leu Ala Cys Thr Ser Val Phe Ala His Gin 
15 10 15 

lie Ala Ala Ala Leu Leu Gly Leu Leu Trp lie Gin Ser Ala Tyr lie 
20 25 30 

Gly His Asp Ser Gly His Tyr Val lie Met Ser Asn Lys Ser Tyr Asn 
35 40 45 

Arg Phe Ala Gin Leu Leu Ser Gly Asn Cys Leu Thr Gly lie Ser He 
50 55 60 

Ala Trp Trp Lys Trp Thr His Asn Ala His His Leu Ala Cys Asn Ser 
65 70 75 80 

Leu Asp Tyr Asp Pro Asp Leu Gin His He Pro Val Phe Ala Val Ser 
85 90 95 

Thr Lys Phe Phe Ser Ser Leu Thr Ser Arg Phe Tyr Asp Arg Lys Leu 
100 105 110 

Thr Phe Gly Pro Val Ala Arg Phe Leu Val Ser Tyr Gin His Phe Thr 
115 120 125 

Tyr Tyr Pro Val Asn Cys Phe Gly Arg He Asn Leu Phe He Gin Thr 
130 135 140 

Phe Leu Leu Leu Phe Ser Lys Arg Glu Val Pro Asp Arg Ala Leu Asn 
145 150 155 160 

Phe Ala Gly He Leu Val Phe Trp Thr Trp Phe Pro Leu Leu Val Ser 
165 170 175 

Cys Leu Pro Asn Trp Pro Glu Arg Phe Phe Phe Val Phe Thr Ser Phe 
180 185 190 

Thr Val Thr Ala Leu Gin His He Gin Phe Thr Leu Asn His Phe Ala 
195 200 205 

Ala Asp Val Tyr Val Gly Pro Pro Thr Gly Ser Asp Trp Phe Glu Lys 
210 215 220 

Gin Ala Ala Gly Thr lie Asp He Ser Cys Arg Ser Tyr Met Asp Trp 
225 230 235 240 

Phe Phe Gly Gly Leu Gin Phe Gin Leu Glu His His 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



245 



250 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gly Xaa Xaa Asn Phe Ala Gly lie Leu Val Phe Trp Thr Trp Phe Pro 
1 5 10 15 

Leu L «u Val Ser Cys Leu Pro Asn Trp Pro Glu Arg Phe Xaa Phe Val 
10 20 25 30 

Phe Thr Gly Phe Thr Val Thr Ala Leu Gin His He Gin Phe Thr Leu 
35 40 45 

15 Asn His Phe Ala Ala Asp Val Tyr Val Gly Pro Pro Thr Gly Ser Asp 

50 55 60 

Trp Phe Glu Lys Gin Ala Ala Gly Thr He Asp He Ser Cys Arg Ser 
65 70 75 80 

Tyr Met Asp Trp Phe Phe Cys Gly Leu Gin Phe Gin Leu Glu His His 
85 90 95 

Leu Phe Pro Arg Leu Pro Arg Cys His Leu Arg Lys Val Ser Pro Val 
25 100 105 ~ 110 

Gly Gin Arg Gly Phe Gin Arg Lys Xaa Asn Leu Ser Xaa 
115 120 125 

30 (2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 amino acids 

(B) TYPE: amino acid 

35 (C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



20 



40 



45 



50 



55 



60 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Pro Ala Thr Glu Val Gly Gly Leu Ala Trp Met He Thr Phe Tyr Val 
1 5 10 15 

Arg Phe Phe Leu Thr Tyr Val Pro Leu Leu Gly Leu Lys Ala Phe Leu 
20 25 30 

Gly Leu Phe Phe He Val Arg Phe Leu Glu Ser Asn Trp Phe Val Trp 
35 40 45 

Val Thr Gin Met Asn His He Pro Met His He Asp His Asp Arg Asn 
50 55 60 

Met Asp Trp Val Ser Thr Gin Leu Gin Ala Thr Cys Asn Val His Lys 
65 70 75 80 

Ser Ala Phe Asn Asp Trp Phe Ser Gly His Leu Asn Phe Gin He Glu 
85 90 95 
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His His Leu Phe Pro Thr Met Pro Arg His Asn Tyr His Xaa Val Ala 
100 105 110 

Pro Leu Val Gin Ser Leu Cys Ala Lys His Gly lie Glu Tyr Gin Ser 
115 120 125 

Lys Pro Leu 
130 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Cys Ser Pro Lys Ser Ser Pro Thr Arg Asn Met Thr Pro Ser Pro Phe 
1 5 10 15 

lie Asp Trp Leu Trp Gly Gly Leu Asn Tyr Gin lie Glu His His Leu 
20 25 30 

Phe Pro Thr Met Pro Arg Cys Asn Leu Asn Arg Cys Met Lys Tyr Val 
35 40 " 45 

Lys Glu Trp Cys Ala Glu Asn Asn Leu Pro Tyr Leu Val Asp Asp Tyr 
50 55 60 

Phe Val Gly Tyr Asn Leu Asn Leu Gin Gin Leu Lys Asn Met Ala Glu 
65 70 75 80 

Leu Val Gin Ala Lys Ala Ala 
85 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Arg His Glu Ala Ala Arg Gly Gly Thr Arg Leu Ala Tyr Met Leu Val 
15 10 15 

Cys Met Gin Trp Thr Asp Leu Leu Trp Ala Ala Ser Phe Tyr Ser Arg 
20 25 30 
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r 



10 



Phe Phe Leu Ser Tyr Ser Pro Phe Tyr Gly Ala Thr Gly Thr Leu Leu 
35 40 45 

Leu Phe Val Ala Val Arg Val Leu Glu Ser His Trp Phe Val Trp He 
50 55 60 

Thr Gin Met Asn His He Pro Lys Glu He Gly His Glu Lys His Arq 
65 70 75 80 

Asp Trp Ala Ser Ser Gin Leu Ala Ala Thr Cys Asn Val Glu Pro Ser 
85 90 95 

f c Leu Phe Ile As P Tr P Phe Ser G1 Y His Leu Asn Phe Gin He Glu His 

° 100 105 no 

His Leu Phe Pro Thr Met Thr Arg His Asn Tyr Arg Xaa Val Ala Pro 
115 120 125 

Leu Val Lys Ala Phe Cys Ala Lys His Gly Leu His Tyr Glu Val 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 14: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



20 



30 



35 



40 



50 



55 



60 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



Leu His His Thr Tyr Thr Asn Ile Ala Gly Ala Asp Pro Asp Val Ser 
15 10 15 

Thr Ser Glu Pro Asp Val Arg Arg He Lys Pro Asn Gin Lys Trp Phe 
20 25 30 

A c Val Asn His Ile Asn Gin His Met Phe Val Pro Phe Leu Tyr Gly Leu 

HD 35 40 45 _ 

Leu Ala^Phe Lys Val Arg He Gin Asp Ile Asn Ile Leu Tyr Phe Val 
50 55 6 o 

Lys Thr Asn Asp Ala Ile Arg Val Asn Pro Ile Ser Thr Trp His Thr 
65 7 0 75 80 

Val Met Phe Trp Gly Gly Lys Ala Phe Phe Val Trp Tyr Arg Leu Ile 
85 90 95 

Val Pro Leu Gin Tyr Leu Pro Leu Gly Lys Val Leu Leu Leu Phe Thr 
100 105 no 

Val Ala Asp Met Val Ser Ser Tyr Trp Leu Ala Leu Thr Phe Gin Ala 
115 120 125 



-147- 



WO 98/46764 W W PCT/US98/07421 



Asn Tyr Val Val Glu Glu Val Gin Trp Pro Leu Pro Asp Glu Asn Gly 
130 135 140 

lie lie Gin Lys Asp Trp Ala Ala Met Gin Val Glu Thr Thr Gin Asp 
5 145 150 155 160 

Tyr Ala His Asp Ser His Leu Trp Thr Ser lie Thr Gly Ser Leu Asn 
165 170 175 

10 Tyr Gin Xaa Val His His Leu Phe Pro His 

180 185 

(2) INFORMATION FOR SEQ ID NO: 15: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



20 



25 



30 



40 



45 



60 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

His Xaa Xaa His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 6 amino acids 
35 (B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



Met Ala Ala Gin lie Lys Lys Tyr lie Thr Ser Asp Glu Leu Lys Asn 
15 10 15 



His Asp Lys Pro Gly Asp Leu Trp lie Ser lie Gin Gly Lys Ala Tyr 
50 20 25 30 

Asp Val Ser Asp Trp Val Lys Asp His Pro Gly Gly Ser Phe Pro Leu 
35 40 45 

55 Lys Ser Leu Ala Gly Gin Glu Val Thr Asp Ala Phe Val Ala Phe His 

50 55 60 



Pro Ala Ser Thr Trp Lys Asn Leu Asp Lys Phe Phe Thr Gly Tyr Tyr 
65 70 75 80 

Leu Lys Asp Tyr Ser Val Ser Glu Val Ser Lys Val Tyr Arg Lys Leu 
85 90 95 
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Val Phe Glu Phe Ser Lys Met Gly Leu Tyr Asp Lys Lys Gly His He 
100 105 * HO 

Met Phe Ala Thr Leu Cys Phe He Ala Met Leu Phe Ala Met Ser Val 
115 120 125 

Tyr Gly Val Leu Phe Cys Glu Gly Val Leu Val His Leu Phe Ser Gly 
130 135 140 

Cys Leu Met Gly Phe Leu Trp He Gin Ser Gly Trp He Gly His Asp 
145 150 155 * 160 



Ala Gly His Tyr Met Val Val Ser Asp Ser Arg Leu Asn Lys Phe Met 
15 165 170 175 



Gly He Phe Ala Ala Asn Cys Leu Ser Gly He Ser He Gly Trp Trp 
180 185 190 

Lys Trp Asn His Asn Ala His His He Ala Cys Asn Ser Leu Glu Tyr 
195 200 205 

Asp Pro Asp Leu Gin Tyr He Pro Phe Leu Val Val Ser Ser Lys Phe 
210 215 220 

Phe Gly Ser Leu Thr Ser His Phe Tyr Glu Lys Arg Leu Thr Phe Asp 
225 230 235 240 



Ser Leu Ser Arg Phe Phe Val Ser Tyr Gin His Trp Thr Phe Tyr Pro 
30 245 250 255 



He Met Cys Ala Ala Arg Leu Asn Met Tyr Val Gin Ser Leu He Met 
260 265 270 

Leu Leu Thr Lys Arg Asn Val Ser Tyr Arg Ala Gin Glu Leu Leu Gly 
275 280 285 

Cys Leu Val Phe Ser He Trp Tyr Pro Leu Leu Val Ser Cys Leu Pro 
290 295 300 

Asn Trp Gly Glu Arg He Met Phe Val He Ala Ser Leu Ser Val Thr 
305 310 315 320 

Gly Met Gin Gin Val Gin Phe Ser Leu Asn His Phe Ser Ser Ser Val 
325 330 335 

Tyr Val Gly Lys Pro Lys Gly Asn Asn Trp Phe Glu Lys Gin Thr Asp 
340 345 350 

Gly Thr Leu Asp He Ser Cys Pro Pro Trp Met Asp Trp Phe His Gly 
355 360 365 

Gly Leu Gin Phe Gin He Glu His His Leu Phe Pro Lys Met Pro Arg 
370 375 380 

Cys Asn Leu Arg Lys He Ser Pro Tyr Val He Glu Leu Cys Lys Lys 
385 390 395 400 

His Asn Leu Pro Tyr Asn Tyr Ala Ser Phe Ser Lys Ala Asn Glu Met 
405 4io 415 
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Thr Leu Arg Thr Leu Arg Asn Thr Ala Leu Gin Ala Arg Asp lie Thr 
420 425 430 

Lys Pro Leu Pro Lys Asn Leu Val Trp Glu Ala Leu His Thr 
5 435 440 445 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 359 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: peptide 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Leu Thr Ala Glu Arg lie Lys Phe Thr Gin Lys Arg Gly Phe Arg 

1 5 10 15 

25 Arg Val Leu Asn Gin Arg Val Asp Ala Tyr Phe Ala Glu His Gly Leu 

20 25 30 

Thr Gin Arg Asp Asn Pro Ser Met Tyr Leu Lys Thr Leu lie lie Val 
35 40 45 

Leu Trp Leu Phe Ser Ala Trp Ala Phe Val Leu Phe Ala Pro Val He 
50 55 60 



30 



Phe Pro Val Arg Leu Leu Gly Cys Met Val Leu Ala He Ala Leu Ala 
35 65 70 75 80 

Ala Phe Ser Phe Asn Val Gly His Asp Ala Asn His Asn Ala Tyr Ser 
85 90 95 

40 Ser Asn Pro His He Asn Arg Val Leu Gly Met Thr Tyr Asp Phe Val 

100 105 110 



45 



55 



60 



Gly Leu Ser Ser Phe Leu Trp Arg Tyr Arg His Asn Tyr Leu His His 
115 120 125 

Thr Tyr Thr Asn He Leu Gly His Asp Val Glu He His Gly Asp Gly 
130 - 135 140 



Ala Val Arg Met Ser Pro Glu Gin Glu His Val Gly He Tyr Arg Phe 
50 145 150 155 160 



Gin Gin Phe Tyr He Trp Gly Leu Tyr Leu Phe He Pro Phe Tyr Trp 

165 170 175 

Phe Leu Tyr Asp Val Tyr Leu Val Leu Asn Lys Gly Lys Tyr His Asp 

180 185 " 190 

His Lys He Pro Pro Phe Gin Pro Leu Glu Leu Ala Ser Leu Leu Gly 

195 200 205 

He Lys Leu Leu Trp Leu Gly Tyr Val Phe Gly Leu Pro Leu Ala Leu 

210 215 220 
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Gly Phe Ser lie Pro Glu Val Leu He Gly Ala Ser Val Thr Tyr Met 
225 230 235 240 

Thr Tyr Gly He Val Val Cys Thr He Phe Met Leu Ala His Val Leu 
245 250 255 

Glu Ser Thr Glu Phe Leu Thr Pro Asp Gly Glu Ser Gly Ala He Asp 
2 60 265 270 

Asp Glu Trp Ala He Cys Gin He Arg Thr Thr Ala Asn Phe Ala Thr 
275 280 285 



Asn Asn Pro Phe Trp Asn Trp Phe Cys Gly Gly Leu Asn His Gin Val 
15 290 295 300 

Thr His His Leu Phe Pro Asn He Cys His He His Tyr Pro Gin Leu 
305 310 * 315 "* 320 

20 Glu Asn He He Lys Asp Val Cys Gin Glu Phe Gly Val Glu Tyr Lys 

325 330 " 335 



Val Tyr Pro Thr Phe Lys Ala Ala He Ala Ser Asn Tyr Arg Trp Leu 
340 345 350 

Glu Ala Met Gly Lys Ala Ser 
355 

(2) INFORMATION FOR SEQ ID NO: 18: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 365 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 
35 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



M et Thr Ser Thr Thr Ser Lys Val Thr Phe Gly Lys Ser He Gly Phe 
45 1 5 10 15 



Arg Lys Glu Leu Asn Arg Arg Val Asn Ala Tyr Leu Glu Ala Glu Asn 
20 25 30 

He Ser Pro Arg Asp Asn Pro Pro Met Tyr Leu Lys Thr Ala He He 
35 40 45 

Leu Ala Trp Val Val Ser Ala Trp Thr Phe Val Val Phe Gly Pro Asp 
50 55 60 

Val Leu Trp Met Lys Leu Leu Gly Cys He Val Leu Gly Phe Gly Val 
65 70 75 " 80 

Ser Ala Val Gly Phe Asn He Ser His Asp Gly Asn His Gly Gly Tyr 
85 90 95 
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Ser Lys Tyr Gin Trp Val Asn Tyr Leu Ser Gly Leu Thr His Asp Ala 
100 105 110 

lie Gly Val Ser Ser Tyr Leu Trp Lys Phe Arg His Asn Val Leu His 
5 115 120 125 

His Thr Tyr Thr Asn He Leu Gly His Asp Val Glu He His Gly Asp 
130 135 140 

10 Glu Leu Val Arg Met Ser Pro Ser Met Glu Tyr Arg Trp Tyr His Arg 

145 150 155 160 



15 



30 



45 



Tyr Gin His Trp Phe He Trp Phe Val Tyr Pro Phe He Pro Tyr Tyr 
165 170 175 

Trp Ser He Ala Asp Val Gin Thr Met Leu Phe Lys Arg Gin Tyr His 
180 185 190 



Asp His Glu He Pro Ser Pro Thr Trp Val Asp He Ala Thr Leu Leu 
20 195 200 205 

Ala Phe Lys Ala Phe Gly Val Ala Val Phe Leu He He Pro He Ala 
210 215 220 

25 Val Gly Tyr Ser Pro Leu Glu Ala Val He Gly Ala Ser He Val Tyr 

225 230 235 240 



Met Thr His Gly Leu Val Ala Cys Val Val Phe Met Leu Ala His Val 
245 250 255 

lie Glu Pro Ala Glu Phe Leu Asp Pro Asp Asn Leu His He Asp Asp 
260 265 270 



Glu Trp Ala He Ala Gin Val Lys Thr Thr Val Asp Phe Ala Pro Asn 
35 275 280 285 

Asn Thr He He Asn Trp Tyr Val Gly Gly Leu Asn Tyr Gin Thr Val 
290 295 300 

40 His His Leu Phe Pro His He Cys His He His Tyr Pro Lys He Ala 

305 310 315 ^ 320 



Pro He Leu Ala Glu Val Cys Glu Glu Phe Gly Val Asn Tyr Ala Val 
325 330 " 335 

His Gin Thr Phe Phe Gly Ala Leu Ala Ala Asn Tyr Ser Trp Leu Lys 
340 345 350 



Lys Met Ser He Asn Pro Glu Thr Lys Ala He Glu Gin 
50 355 360 365 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

60 (ii) MOLECULE TYPE: other nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

5 CCAAGCTTCT GCAGGAGCTC TTTTTTTTTT ttttt 35 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "Synthetic oligonucleotide" 

(ix) FEATURE: 
20 (A) NAME /KEY : misc_f eature 

(B) LOCATION: 21 

(D) OTHER INFORMATION: /number= 1 
/note= "N=Inosine or Cytosine" 

25 (ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION: 27 

(D) OTHER INFORMATION : /number= 2 
/note= "N=Inosine or Cytosine" 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
CUACUACUAC UACAYCAYAC NTAYACNAAY AT 32 
(2) INFORMATION FOR SEQ ID NO: 21: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
45 (A) DESCRIPTION: /desc = "Synthetic oligonucleotide" 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 
50 (B) LOCATION: 13 

(D) OTHER INFORMATION: /nuraber= 1 
/note= "N=Inosine or Cytosine" 

(ix) FEATURE: 
55 (A) NAME /KEY : misc_f eature 

(B) LOCATION: 19 

(D) OTHER INFORMATION: /number= 2 
/note= "N=Inosine or Cytosine" 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
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10 



CAUCAUCAUC AUNGGRAANA RRTGRTG 27 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
CUACUACUAC UAGGAGTCCT CTACGGTGTT TTG 33 
20 (2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



30 



(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

35 CAUCAUCAUC AUATGATGCT CAAGCTGAAA CTG 33 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: peptide 



50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

Gin Xaa Xaa His His 
1 5 

55 (2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
CUACUACUAC UACTCGAGCA AGATGGGAAC GGACCAAGG 39" 
10 (2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



20 



55 



60 



(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

25 CAUCAUCAUC AUCTCGAGCT ACTCTTCCTT GGGACGGAG 39 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: other nucleic acid 



40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

CUACUACUAC UATCTAGACT CGAGACCATG GCTGCTGCTC CAGTGTG 47 
(2) INFORMATION FOR SEQ ID NO: 28: 

45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
50 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
CAUCAUCAUC AUAGGCCTCG AGTTACTGCG CCTTACCCAT 4 0 
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15 



40 



45 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

CUACUACUA CUAGGATCCA TGGCACCTCC CAACACT 37 

(2) INFORMATION FOR SEQ ID NO: 30: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 

30 CAUCAUCAU CAUGGTACCT CGAGTTACTT CTTGAAAAAG AC 42 

(2) INFORMATION FOR SEQ ID NO: 31: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1219 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 2 692004) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



GCACGCCGAC 


CGGCGCCGGG 


AGATCCTGGC 


AAAGTATCCA 


GAGATAAAGT 


CCTTGATGAA 


60 


ACCTGATCCC 


AATTTGATAT 


GGATTATAAT 


TATGATGGTT 


CTCACCCAGT 


TGGGTGCATT 


120 


TTACATAGTA 


AAAGACTTGG 


ACTGGAAATG 


GGTCATATTT 


GGGGCCTATG 


CGTTTGGCAG 


180 


TTGCATTAAC 


CACTCAATGA 


CTCTGGCTAT 


TCATGAGATT 


GCCCACAATG 


CTGCCTTTGG 


240 


CAACTGCAAA 


GCAATGTGGA 


ATCGCTGGTT 


TGGAATGTTT 


GCTAATCTTC 


CTATTGGGAT 


300 


TCCATATTCA 


ATTTCCTTTA 


AGAGGTATCA 


CATGGATCAT 


CATCGGTACC 


TTGGAGCTGA 


360 


TGGCGTCGAT 


GTAGATATTC 


CTACCGATTT 


TGAGGGCTGG 


TTCTTCTGTA 


CCGCTTTCAG 


420 


AAAGTTTATA 


TGGGTTATTC 


TTCAGCCTCT 


CTTTTATGCC 


TTTCGACCTC 


TGTTCATCAA 


480 
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15 



20 



25 



30 



CCCCAAACCA 


ATTACGTATC 


TGGAAGTTAT 


CAATACCGTG 


GCACAGGTCA 


CTTTTGACAT 


540 


TTTAATTTAT 


TACTTTTTGG 


GAATTAAATC 


CTTAGTCTAC 


ATGTTGGCAG 


CATCTTTACT 


600 


TGGCCTGGGT 


TTGCACCCAA 


TTTCTGGACA 


TTTTATAGCT 


GAGCATTACA 


TGTTCTTAAA 


660 


GGGTCATGAA 


ACTTACTCAT 


ATTATGGGCC 


TCTGAATTTA 


CTTACCTTCA 


ATGTGGGTTA 


720 


TCATAATGAA 


CATCATGATT 


TCCCCAACAT 


TCCTGGAAAA 


AGTCTTCCAC 


TGGTGAGGAA 


780 


AATAGCAGCT 


GAATACTATG 


ACAACCTCCC 


TCACTACAAT 


TCCTGGATAA 


AAGTACTGTA 


840 


TGATTTTGTG 


AT G GAT GAT A 


CAATAAGTCC 


CTACTCAAGA 


ATGAAGAGGC 


ACCAAAAAGG 


900 


AG AC? AT Tfi 




Inl un J. ± r\<j J. 


o \^ AAA^jtjlj A 


11L11L1 UL. A 


AAAC1 T I ACaA 


you 


TGATAAAATG 


GAATTTTTGC 


ATTATTAAAC 


TTGAGACCAG 


TGATGCTCAG 


AAGCTCCCCT 


1020 


GGCACAATTT 


CAGAGTAAGA 


GCTCGGTGAT 


ACCAAGAAGT 


GAATCTGGCT 


TTTAAACAGT 


1080 


CAGCCTGACT 


CTGTACTGCT 


CAGTTTCACT 


CACAGGAAAC 


TTGTGACTTG 


TGTATTATCG 


1140 


TCATTGAGGA 


TGTTTCACTC 


ATGTCTGTCA 


TTTTATAAGC 


ATATCATTTA 


AAAAGCTTCT 


1200 


AAAAAGCTAT 


TTCGCCAGG 










1219 



(2) INFORMATION FOR SEQ ID NO: 32: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 655 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 

35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 2153526) 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



TTACCTTCTA CGTCCGCTTC TTCCTCACTT ATGTGCCACT ATTGGGGCTG AAAGCTTCCT 60 

45 

GGGCCTTTTC TTCATAGTCA GGTTCCTGGA AAGCAACTGG TTTGTGTGGG TGACACAGAT 120 

GAACCATATT CCCATGCACA TTGATCATGA CCGGAACATG GACTGGGTTT CCACCCAGCT 180 

50 CCAGGCCACA TGCAATGTCC ACAAGTCTGC CTTCAATGAC TGGTTCAGTG GACACCTCAA 240 

CTTCCAGATT GAGCACCATC TTTTTCCCAC GATGCCTCGA CACAATTACC ACAAAGTGGC 300 

^ TCCCCTGGTG CAGTCCTTGT GTGCCAAGCA TGGCATAGAG TACCAGTCCA AGCCCCTGCT 360 

GTCAGCCTTC GCCGACATCA TCCACTCACT AAAGGAGTCA GGGCAGCTCT GGCTAGATGC 420 

CTATCTTCAC CAATAACAAC AGCCACCCTG CCCAGTCTGG AAGAAGAGGA GGAAGACTCT 4 80 

60 GGAGCCAAGG CAGAGGGGAG CTTGAGGGAC AATGCCACTA TAGTTTAATA CTCAGAGGGG 540 

GTTGGGTTTG GGGACATAAA GCCTCTGACT CAAACTCCTC CCTTTTATCT TCTAGCCACA 600 
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GTTCTAAGAC CCAAAGTGGG GGGTGGACAC AGAAGTCCCT AGGAGGGAAG GAGCT 655 

5 (2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 304 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 3506132) 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

GTCTTTTACT TTGGCAATGG CTGGATTCCT ACCCTCATCA CGGCCTTTGT CCTTGCTACC 60 

TCTCAGGCCC AAGCTGGATG GCTGCAACAT GATTATGGCC ACCTGTCTGT CTACAGAAAA 120 

20 

CCCAAGTGGA ACCACCTTGT CCACAAATTC GTCATTGGCC ACTTAAAGGG TGCCTCTGCC 180 

AACTGGTGGA ATCATCGCCA CTTCCAGCAC CACGCCAAGC CTAACATCTT CCACAAGGAT 24 0 

25 CCCGATGTGA ACATGCTGCA CGTGTTTGTT CTGGGCGAAT GGCAGCCCAT CGAGTACGGC 300 

AAGA 304 

30 (2) INFORMATION FOR SEQ ID NO: 34: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 918 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 3854933) 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

CAGGGACCTA CCCCGCGCTA CTTCACCTGG GACGAGGTGG CCCAGCGCTC AGGGTGCGAG 60 

GAGCGGTGGC TAGTGATCGA CCGTAAGGTG TACAACATCA GCGAGTTCAC CCGCCGGCAT 120 

45 

CCAGGGGGCT CCCGGGTCAT CAGCCACTAC GCCGGGCAGG ATGCCACGGA TCCCTTTGTG 180 

GCCTTCCACA TCAACAAGGG CCTTGTGAAG AAGTATATGA ACTCTCTCCT GATTGGAGAA 2 40 

50 CTGTCTCCAG AGCAGCCCAG CTTTGAGCCC ACCAAGAATA AAGAGCTGAC AGATGAGTTC 300 

CGGGAGCTGC GGGCCACAGT GGAGCGGATG GGGCTCATGA AGGCCAACCA TGTCTTCTTC 360 

^ CTGCTGTACC TGCTGCACAT CTTGCTGCTG GATGGTGCAG CCTGGCTCAC CCTTTGGGTC 420 

TTTGGGACGT CCTTTTTGCC CTTCCTCCTC TGTGCGGTGC TGCTCAGTGC AGTTCAGGCC 4 80 

CAGGCTGGCT GGCTGCAGCA TGACTTTGGG CACCTGTCGG TCTTCAGCAC CTCAAAGTGG 540 

60 AACCATCTGC TACATCATTT TGTGATTGGC CACCTGAAGG GGGCCCCCGC CAGTTGGTGG 600 

AACCAC AT GC ACTTCCAGCA CCATGCCAAG CCCAACTGCT TCCGCAAAGA CCCAGACATC 660 
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AACATGCATC CCTTCTTCTT TGCCTTGGGG AAGATCCTCT CTGTGGAGCT TGGGAAACAG 720 

AAGAAAAAAT ATATGCCGTA CAACCACCAG CACARATACT TCTTCCTAAT TGGGCCCCCA 7 80 

GCCTTGCTGC CTCTCTACTT CCAGTGGTAT ATTTTCTATT TTGTTATCCA GCGAAAGAAG 840 

TGGGTGGACT TGGCCTGGAT CAGCAAACAG GAATACGATG AAGCCGGGCT TCCATTGTCC 900 

10 ACCGCAAATG CTTCTAAA 918 

(2) INFORMATION FOR SEQ ID NO: 35: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1686 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 

<ii) MOLECULE TYPE: other nucleic acid (Edited Contig 2511785) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 



25 





GCCACTTAAA 


GGGTGCCTCT 


GCCAACTGGT 


GGAATCATCG 


CCACTTCCAG 


CACCACGCCA 


60 




AGCCTAACAT 


CTTCCACAAG 


GATCCCGATG 


TGAACATGCT 


GCACGTGTTT 


GTTCTGGGCG 


120 


30 


AATGGCAGCC 


CATCGAGTAC 


GGCAAGAAGA 


AGCTGAAATA 


CCTGCCCTAC 


AATCACCAGC 


180 




ACGAATACTT 


CTTCCTGATT 


GGGCCGCCGC 


TGCTCATCCC 


CATGTATTTC 


CAGTACCAGA 


240 


35 


TCATCATGAC 


CATGATCGTC 


CATAAGAACT 


GGGTGGACCT 


GGCCTGGGCC 


GTCAGCTACT 


300 


ACATCCGGTT 


CTTCATCACC 


TACATCCCTT 


TCTACGGCAT 


CCTGGGAGCC 


CTCCTTTTCC 


360 




TCAACTTCAT 


CAGGTTCCTG 


GAGAGCCACT 


GGTTTGTGTG 


GGTCACACAG 


ATGAATCACA 


420 


40 


TCGTCATGGA 


GATTGACCAG 


GAGGCCTACC 


GTGACTGGTT 


CAGTAGCCAG 


CTGACAGCCA 


480 




CCTGCAACGT 


GGAGCAGTCC 


TTCTTCAACG 


ACTGGTTCAG 


TGGACACCTT 


AACTTCCAGA 


540 


45 


TTGAGCACCA 


CCTCTTCCCC 


ACCATGCCCC 


GGCACAACTT 


ACACAAGATC 


GCCCCGCTGG 


600 


TGAAGTCTCT 


ATGTGCCAAG 


CATGGCATTG 


AATACCAGGA 


GAAGCCGCTA 


CTGAGGGCCC 


660 




TGCTGGACAT 


CATCAGGTCC 


CTGAAGAAGT 


CTGGGAAGCT 


GTGGCTGGAC 


GCCTACCTTC 


720 


50 


ACAAATGAAG 


CCACAGCCCC 


CGGGACACCG 


TGGGGAAGGG 


GTGCAGGTGG 


GGTGATGGCC 


780 




AGAGGAATGA 


TGGGCTTTTG 


TTCTGAGGGG 


TGTCCGAGAG 


GCTGGTGTAT 


GCACTGCTCA 


840 


55 


CGGACCCCAT 


GTTGGATCTT 


TCTCCCTTTC 


TCCTCTCCTT 


TTTCTCTTCA 


CAfCTCCCCC 


900 


ATAGCACCCT 


GCCCTCATGG 


GACCTGCCCT 


CCCTCAGCCG 


TCAGCCATCA 


GCCATGGCCC 


960 




TCCCAGTGCC 


TCCTAGCCCC 


TTCTTCCAAG 


GAGCAGAGAG 


GTGGCCACCG 


GGGGTGGCTC 


1020 


60 


TGTCCTACCT 


CCACTCTCTG 


CCCCTAAAGA 


TGGGAGGAGA 


CCAGCGGTCC 


ATGGGTCTGG 


1080 




CCTGTGAGTC 


TCCCCTTGCA 


GCCTGGTCAC 


TAGGCATCAC 


CCCCGCTTTG 


GTTCTTCAGA 


1140 
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TGCTCTTGGG GTTCATAGGG GCAGGTCCTA GTCGGGCAGG GCCCCTGACC CTCCCGGCCT 1200 

GGCTTCACTC TCCCTGACGG CTGCCATTGG TCCACCCTTT CATAGAGAGG CCTGCTTTGT 12 60 

5 

TACAAAGCTC GGGTCTCCCT CCTGCAGCTC GGTTAAGTAC CCGAGGCCTC TCTTAAGATG 1320 

TCCAGGGCCC CAGGCCCGCG GGCACAGCCA GCCCAAACCT TGGGCCCTGG AAGAGTCCTC 1380 

10 CACCCCATCA CTAGAGTGCT CTGACCCTGG GCTTTCACGG GCCCCATTCC ACCGCCTCCC 1440 

CAACTTGAGC CTGTGACCTT GGGACCAAAG GGGGAGTCCC TCGTCTCTTG TGACTCAGCA 1500 

GAGGCAGTGG CCACGTTCAG GGAGGGGCCG GCTGGCCTGG AGGCTCAGCC CACCCTCCAG 15 60 

15 

CTTTTCCTCA GGGTGTCCTG AGGTCCAAGA TTCTGGAGCA ATCTGACCCT TCTCCAAAGG 1620 

CTCTGTTATC AGCTGGGCAG TGCCAGCCAA TCCCTGGCCA TTTGGCCCCA GGGGACGTGG 1680 
20 GCCCTG 



(2) INFORMATION FOR SEQ ID NO: 36: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1843 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



30 



35 



(ii) MOLECULE TYPE: other nucleic acid (Contig 2535) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 



1686 



GTCTTTTACT TTGGCAATGG CTGGATTCCT ACCCTCATCA CGGCCTTTGT CCTTGCTACC 60 

TCTCAGGCCC AAGCTGGATG GCTGCAACAT GATTATGGCC ACCTGTCTGT CTACAGAAAA 120 

40 CCCAAGTGGA ACCACCTTGT CCACAAATTC GTCATTGGCC ACTTAAAGGG TGCCTCTGCC 180 

AACTGGTGGA ATCATCGCCA CTTCCAGCAC CACGCCAAGC CTAACATCTT CCACAAGGAT 2 40 

CCCGATGTGA ACATGCTGCA CGTGTTTGTT CTGGGCGAAT GGCAGCCCAT CGAGTACGGC 300 

45 

AAGAAGAAGC TGAAATACCT GCCCTACAAT CACCAGCACG AATACTTCTT CCTGATTGGG 360 

CCGCCGCTGC TCATCCCCAT GTATTTCCAG TACCAGATCA TCATGACCAT GATCGTCCAT 420 

50 AAGAACTGGG TGGACCTGGC CTGGGCCGTC AGCTACTACA TCCGGTTCTT CATCACCTAC 4 80 

ATCCCTTTCT ACGGCATCCT GGGAGCCCTC CTTTTCCTCA ACTTCATCAG GTTCCTGGAG 54 0 

^ AGCCACTGGT TTGTGTGGGT CACACAGATG AATCACATCG TCATGGAGAT TGACCAGGAG 600 

GCCTACCGTG ACTGGTTCAG TAGCCAGCTG ACAGCCACCT GCAACGTGGA GCAGTCCTTC 660 

TTCAACGACT GGTTCAGTGG ACACCTTAAC TTCCAGATTG AGCACCACCT CTTCCCCACC 720 

60 ATGCCCCGGC ACAACTTACA CAAGATCGCC CCGCTGGTGA AGTCTCTATG TGCCAAGCAT 780 

GGCATTGAAT ACCAGGAGAA GCCGCTACTG AGGGCCCTGC TGGACATCAT CAGGTCCCTG 84 0 
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AAGAAGTCTG GGAAGCTGTG GCTGGACGCC TACCTTCACA AATGAAGCCA CAGCCCCCGG 
GACACCGTGG GGAAGGGGTG CAGGTGGGGT GATGGCCAGA GGAATGATGG GCTTTTGTTC 
TGAGGGGTGT CCGAGAGGCT GGTGTATGCA CTGCTCACGG ACCCCATGTT GGATCTTTCT 
CCCTTTCTCC TCTCCTTTTT CTCTTCACAT CTCCCCCATA GCACCCTGCC CTCATGGGAC 
CTGCCCTCCC TCAGCCGTCA GCCATCAGCC ATGGCCCTCC CAGTGCCTCC TAGCCCCTTC 
TTCCAAGGAG CAGAGAGGTG GCCACCGGGG GTGGCTCTGT CCTACCTCCA CTCTCTGCCC 
CTAAAGATGG GAGGAGACCA GCGGTCCATG GGTCTGGCCT GTGAGTCTCC CCTTGCAGCC 
TGGTCACTAG GCATCACCCC CGCTTTGGTT CTTCAGATGC TCTTGGGGTT CATAGGGGCA 
GGTCCTAGTC GGGCAGGGCC CCTGACCCTC CCGGCCTGGC TTCACTCTCC CTGACGGCTG 
CCATTGGTCC ACCCTTTCAT AGAGAGGCCT GCTTTGTTAC AAAGCTCGGG TCTCCCTCCT 
GCAGCTCGGT TAAGTACCCG AGGCCTCTCT TAAGATGTCC AGGGCCCCAG GCCCGCGGGC 
ACAGCCAGCC CAAACCTTGG GCCCTGGAAG AGTCCTCCAC CCCATCACTA GAGTGCTCTG 
ACCCTGGGCT TTCACGGGCC CCATTCCACC GCCTCCCCAA CTTGAGCCTG TGACCTTGGG 
ACCAAAGGGG GAGTCCCTCG TCTCTTGTGA CTCAGCAGAG GCAGTGGCCA CGTTCAGGGA 
GGGGCCGGCT GGCCTGGAGG CTCAGCCCAC CCTCCAGCTT TTCCTCAGGG TGTCCTGAGG 
TCCAAGATTC TGGAGCAATC TGACCCTTCT CCAAAGGCTC TGTTATCAGC TGGGCAGTGC 
CAGCCAATCC CTGGCCATTT GGCCCCAGGG GACGTGGGCC CTG 

-(2) INFORMATION FOR SEQ ID NO:37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2257 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 253538a) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

CAGGGACCTA CCCCGCGCTA CTTCACCTGG GACGAGGTGG CCCAGCGCTC AGGGTGCGAG 60 

GAGCGGTGGC TAGTGATCGA CCGTAAGGTG TACAACATCA GCGAGTTCAC CCGCCGGCAT 120 

CCAGGGGGCT CCCGGGTCAT CAGCCACTAC GCCGGGCAGG ATGCCACGGA TCCCTTTGTG 180 

GCCTTCCACA TCAACAAGGG CCTTGTGAAG AAGTATATGA ACTCTCTCCT GATTGGAGAA 240 

CTGTCTCCAG AGCAGCCCAG CTTTGAGCCC ACCAAGAATA AAGAGCTGAC AGATGAGTTC 300 

CGGGAGCTGC GGGCCACAGT GGAGCGGATG GGGCTCATGA AGGCCAACCA TGTCTTCTTC 360 

CTGCTGTACC TGCTGCACAT CTTGCTGCTG GATGGTGCAG CCTGGCTCAC CCTTTGGGTC 420 
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TTTGGGACGT 


CCTTTTTGCC 


GTTCCTCCTC 


TGTGCGGTGC 


TGCTCAGTGC 


AGTTCAGCAG 


480 




GCCCAAGCTG 


GATGGCTGCA 


ACATGATTAT 


GGCCACCTGT 


CTGTCTACAG 


AAAACCCAAG 


540 


5 


TGGAACCACC 


TTGTCCACAA 


ATTCGTCATT 


GGCCACTTAA 


AGGGTGCCTC 


TGCCAACTGG 


600 




TGGAATCATC 


GCCACTTCCA 


GCACCACGCC 


AAGCCTAACA 


TCTTCCACAA 


GG AT CCCGAT 


660 


10 


GTGAACATGC 


TGCACGTGTT 


TGTTCTGGGC 


GAATGGCAGC 


CCATCGAGTA 


CGGCAAGAAG 


720 


AAGCTGAAAT 


ACCTGCCCTA 


CAATCACCAG 


CACGAATACT 


TCTTCCTGAT 


TGGGCCGCCG 


780 




CTGCTCATCC 


CCATGTATTT 


CCAGTACCAG 


ATCATCATGA 


CCATGATCGT 


CCATAAGAAC 


840 


15 


TGGGTGGACC 


TGGCCTGGGC 


CGTCAGCTAC 


TACATCCGGT 


TCTTCATCAC 


CTACATCCCT 


900 




TTCTACGGCA 


TCCTGGGAGC 


CCTCCTTTTC 


CTCAACTTCA 


TCAGGTTCCT 


GGAGAGCCAC 


960 


20 


TGGTTTGTGT 


GGGTCACACA 


GATGAATCAC 


ATCGTCATGG 


AGATTGACCA 


GGAGGCCTAC 


1020 




CGTGACTGGT 


TCAGTAGCCA 


GCTGACAGCC 


ACCTGCAACG 


TGGAGCAGTC 


CTTCTTCAAC 


1080 




GACTGGTTCA 


GTGGACACCT 


TAACTTCCAG 


ATTGAGCACC 


ACCTCTTCCC 


CACCATGCCC 


1140 


25 


CGGCACAACT 


TACACAAGAT 


CGCCCCGCTG 


GTGAAGTCTC 


TATGTGCCAA 


GCATGGCATT 


1200 




GAATACCAGG 


AGAAGCCGCT 


ACTGAGGGCC 


CTGCTGGACA 


TCATCAGGTC 


CCTGAAGAAG 


1260 


30 


TCTGGGAAGC 


TGTGGCTGGA 


CGCCTACCTT 


CACAAATGAA 


GCCACAGCCC 


CCGGGACACC 


1320 




GTGGGGAAGG 


GGTGCAGGTG 


GGGTGATGGC 


CAGAGGAATG 


ATGGGCTTTT 


GTTCTGAGGG 


1380 




GTGTCCGAGA 


GGCTGGTGTA 


TGCACTGCTC 


ACGGACCCCA 


TGTTGGATCT 


TTCTCCCTTT 


1440 


35 


CTCCTCTCCT 


TTTTCTCTTC 


ACATCTCCCC 


CATAGCACCC 


TGCCCTCATG 


GGACCTGCCC 


1500 




TCCCTCAGCC 


GTCAGCCATC 


AGCCATGGCC 


CTCCCAGTGC 


CTCCTAGCCC 


CTTCTTCCAA 


1560 


40 


GGAGCAGAGA 


GGTGGCCACC 


GGGGGTGGCT 


CTGTCCTACC 


TCCACTCTCT 


GCCCCTAAAG 


1620 


ATGGGAGGAG 


ACCAGCGGTC 


CATGGGTCTG 


GCCTGTGAGT 


CTCCCCTTGC 


AGCCTGGTCA 


1680 




CTAGGCATCA 


CCCCCGCTTT 


GGTTCTTCAG 


ATGCTCTTGG 


GGTTCATAGG 


GGCAGGTCCT 


1740 


45 


AGTCGGGCAG 


GGCCCCTGAC 


CCTCCCGGCC 


TGGCTTCACT 


CTCCCTGACG 


GGTGCCATTG 


1800 




GTCCACCCTT 


TCATAGAGAG 


GCCTGCTTTG 


TTACAAAGCT 


CGGGTCTCCC 


TCCTGCAGCT 


1860 


50 


CGGTTAAGTA 


CCCGAGGCCT 


CTCTTAAGAT 


GTCCAGGGCC 


CCAGGCCCGC 


GGGCACAGCC 


1920 


AGCCCAAACC 


TTGGGCCCTG 


GAAGAGTCCT 


CCACCCCATC 


ACTAGAGTGC 


TCTGACCCTG 


1980 




GGCTTTCACG 


GGCCCCATTC 


CACCGCCTCC 


CCAACTTGAG 


CCTGTGACCT 


TGGGACCAAA 


2040 


55 


GGGGGAGTCC 


CTCGTCTCTT 


GTGACTCAGC 


AGAGGCAGTG 


GCCACGTTCA 


GGGAGGGGCC 


2100 




GGCTGGCCTG 


GAGGCTCAGC 


CCACCCTCCA 


GCTTTTCCTC 


AGGGTGTCCT 


GAGGTCCAAG 


2160 


60 


ATTCTGGAGC 


AATCTGACCC 


TTCTCCAAAG 


GCTCTGTTAT 


CAGCTGGGCA 


GTGCCAGCCA 


2220 


ATCCCTGGCC 


ATTTGGCCCC 


AGGGGACGTG 


GGCCCTG 






2257 
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(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 411 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: amino acid (Translation of Contig 2692004) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 

15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



His 


Ala 


ASp 


Arg Arg 


Arg 


Cj-LU 


T 1 a 

xxe 


Leu 


Ala 


Lys 


Tyr 


Pro 


Glu 


xxe 


1 








5 










10 












Lys 


Ser 


Leu 


Met 


Lys 


Pro 


Asp 


Pro 


Asn 


Leu 


x±e 


Trp 


He 


He 


T 1 a 

ixe 










20 










25 










30 


Met 


Met 


vax 


Leu 


Thr 


Gin 


Leu 


uxy 


Ala 


Phe 


Tyr 


He 


Val 


Lys 


Asp 










35 










40 










45 


Leu 


Asp 


Trp 


Lys 


Trp 


Val 


lie 


rne 


Gly Ala 


Tyr 


Ala 


Phe 


Gly 


Ser 










50 




















60 


Cys 


He 


7\ c!> rS 


rilS 


Ser 


Met 


inr 


Leu 


Ala 


T 1 Q 

xxe 


HIS 


Glu 


He 


Ala 


His 










65 










/ U 










75 


Asn 


Ala 


TV 1 -a 


fne 


Gly Asn 


Cys 


Lys 


Ala 


Met 


Trp 


Asn 


Arg 


Trp 


Phe 










80 










oD 










90 


Gly Met 


irne 


Hid 


Asn 


Leu 


Pro 


xxe 


Gly 


Tl _ 

lie 


Pro 


Tyr 


Ser 


He 


Ser 










95 










xUu 










105 


Phe 


Lys 


Arg 


Tyr 


His 


Met 


Asp 


nlS 


His 


Arg 


Tyr 


Leu Gly Ala 


Asp 










110 










115 










120 


Gly Val 


Asp 


vax 


Asp 


He 


Pro 


Thr 


Asp 


Phe 


Glu 


Gly 


Trp 


Phe 


Phe 










125 










130 










135 


Cys 


Thr 


nid 


Fne 


Arg 


Lys 


irne 


He 


Trp 


Val 


He 


Leu 


Gin 


Pro 


Leu 










140 










145 










150 


Phe 


Tyr 




pne 


Arg 


Pro 


Leu 


Phe 


He 


Asn 


Pro 


Lys 


Pro 


He 


Thr 










155 










160 










165 


Tyr 


Leu 


Pin 

IjXU 


vai 


He 


Asn 


inr 


Val 


Ala 


Gin 


Val 


Thr 


Phe 


Asp 


He 










170 










175 










180 


Leu 


He 


Tyr 


Tyr 


Phe 


Leu 


C 1 \r 

oiy 


He 


Lys 


Ser 


Leu 


Val 


Tyr 


Met 


Leu 










185 










190 










195 


Ala 


Ala 


Ser 


Leu 


Leu Gly 


Leu 


Gly 


Leu 


His 


Pro 


He 


Ser 


Gly His 










200 










205 










210 


Phe 


He 


Ala 


Glu 


His 


Tyr 


Met 


Phe 


Leu 


Lys 


Gly 


His 


Glu 


Thr 


Tyr 










215 










220 










225 


Ser 


Tyr 


Tyr 


Gly 


Pro 


Leu 


Asn 


Leu 


Leu 


Thr 


Phe 


Asn 


Val 


Gly 


Tyr 










230 










235 










240 


His 


Asn 


Glu 


His 


His 


Asp 


Phe 


Pro 


Asn 


He 


Pro 


Gly 


Lys 


Ser 


Leu 










245 










250 










255 


Pro 


Leu 


Val 


Arg 


Lys 


He 


Ala 


Ala 


Glu 


Tyr 


Tyr Asp 


Asn 


Leu 


Pro 










260 










265 










270 


His 


Tyr 


Asn 


Ser 


Trp 


He 


Lys 


Val 


Leu 


Tyr 


Asp 


Phe 


Val 


Met 


Asp 










275 










280 










285 


Asp 


Thr 


He 


Ser 


Pro 


Tyr 


Ser 


Arg Met 


Lys 


Arg 


His 


Gin 


Lys 


Gly 










290 










295 










300 


Glu 


Met 


Val 


Leu 


Glu 


* * * 


He 


Ser 


Leu 


Val 


Pro 


Lys 


Gly 


Phe 


Phe 










305 










310 








315 


Ser 


Lys 


Thr 


Leu 


Asp 


Asp 


Lys 


Met 


Glu 


Phe 


Leu 


His 


Tyr 


★ * + 


Thr 










320 










325 








330 


* * * 


Asp 


Gin 


*★ + 


Cys 


Ser 


Glu 


Ala 


Pro 


Leu 


Ala 


Gin 


Phe 


Gin 


Ser 










335 










340 










345 


Lys 


Ser 


Ser 


Val 


He 


Pro 


Arg 


Ser 


Glu 


Ser 


Gly 


Phe 


* * + 


Thr 


Val 










350 










355 










360 
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Gly Asn Leu *** 
375 

His Val Cys His 
390 

Ser Tyr Phe Ala 
410 



(2) INFORMATION FOR SEQ ID NO: 39: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 218 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 



Ser Leu Thr Leu Tyr Cys Ser Val Ser Leu Thr 
365 370 

Leu Val Tyr Tyr Arg His *** Gly Cys Phe Thr 
380 ' 385 

Phe He Ser He Ser Phe Lys Lys Leu Leu Lys 
400 405 

Arg 



20 



(ii) MOLECULE TYPE: amino acid (Translation of Contig 2153526) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 





Tyr 


Leu 


Leu 


Arg 


Pro 


Leu 


Leu 


Pro 


His 


Leu 


Cys 


Ala Thr 


He 


Gly 




1 








5 










10 








15 


25 


Ala 


Glu 


Ser 


Phe 


Leu 


Gly 


Leu 


Phe 


Phe 


He 


Val 


Arg Phe 


Leu 


Glu 










20 










25 






30 




Ser 


Asn 


Trp 


Phe 


Val 
35 


Trp 


Val 


Thr 


Gin 


Met 
40 


Asn 


His He 


Pro 


Met 
45 




His 


He 


Asp 


His 


Asp 


Arg 


Asn 


Met 


Asp 


Trp 


Val 


Ser Thr 


Gin 


Leu 


30 










50 










55 








60 


Gin 


Ala 


Thr 


Cys 


Asn 
65 


Val 


His 


Lys 


Ser 


Ala 
70 


Phe 


Asn Asp 


Trp 


Phe 
75 




Ser 


Gly 


His 


Leu 


Asn 
80 


Phe 


Gin 


He 


Glu 


His 
85 


His 


Leu Phe 


Pro 


Thr 
90 


35 


Met 


Pro 


Arg 


His 


Asn 


Tyr 


His 


Lys 


Val 


Ala 


Pro 


Leu Val 


Gin 


Ser 










95 










100 








105 




Leu 


Cys 


Ala 


Lys 


His 


Gly 


He 


Glu 


Tyr 


Gin 


Ser 


Lys Pro 


Leu 


Leu 












110 










115 








120 




Ser 


Ala 


Phe 


Ala 


Asp 


He 


He 


His 


Ser 


Leu 


Lys 


Glu Ser 


Gly 


Gin 


40 










125 










130 






135 


Leu 


Trp 


Leu 


Asp 


Ala 
140 


Tyr 


Leu 


His 


Gin 


* * + 
145 


Gin 


Gin Pro 


Pro 


Cys 
150 




Pro 


Val 


Trp 


Lys 


Lys 


Arg 


Arg 


Lys 


Thr 


Leu 


Glu 


Pro Arg 


Gin 


Arg 












155 










160 






165 


45 


Gly Ala 


* * * 


Gly 


Thr 


Met 


Pro 


Leu 


* * * 


Phe 


Asn 


Thr Gin 


Arg 


Gly 










170 










175 






180 




Leu 


Gly 


Leu 


Gly 


Thr 


★ * * 


Ser 


Leu 


*** 


Leu 


Lys 


Leu Leu 


Pro 


Phe 












185 










190 






195 




He 


Phe 


* ★ * 


Pro 


Gin 


Phe 


+ * + 


Asp 


Pro 


Lys 


Trp 


Gly Val 


Asp 


Thr 


50 










200 










205 








210 


Glu 


Val 


Pro 


Arg 


Arg 
215 


Glu 


Gly 


Ala 















55 (2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 

60 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: amino acid (Translation of Contig 3506132) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

5 



v a -L 


Phe 


Tyr 


Phe 




Asn 


y 


T r~r\ T1 d 

l irp lie 


Pro 


inr 


Leu 


He 


j. nr 


7\ "I -a 


1 








5 








10 










15 


Phe 


Val 


Leu 


Ala 


Thr 


Ser 


Gin 


Ala Gin 


Ala 


Gly 


Trp 


Leu 


Gin 


His 










20 








25 










30 


Asp 


Tyr 


Gly 


His 


Leu 


Ser 


Val 


Tyr Arg 


Lys 


Pro 


Lys 


Trp 


Asn 


His 










35 








40 










45 


Leu 


Val 


His 


Lys 


Phe 


Val 


He 


Gly His 


Leu 


Lys 


Gly Ala 


Ser 


Ala 










50 








55 










60 


Asn 


Trp 


Trp 


Asn 


His 


Arg 


His 


Phe Gin 


His 


His 


Ala 


Lys 


Pro 


Asn 










65 








70 










75 


Leu 


Gly 


Glu 


Trp 


Gin 


Pro 


He 


Glu Tyr 


Gly 


Lys 


Xxx 
















80 








85 












(2) 


INFORMATION 


FOR 


SEQ 


ID 


NO: 41: 















(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 306 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: amino acid (Translation of Contig 3854933) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 



35 



40 



45 



50 



55 



60 



Gin Gly 


Pro 


Thr 


Pro 


Arg 


Tyr 


Phe 


Thr 


Trp 


Asp 


Glu 


Val 


Ala 


Gin 


1 








5 










10 










15 


Arg 


Ser 


Gly 


Cys 


Glu 
20 


Glu 


Arg 


Trp 


Leu 


Val 
25 


He 


Asp 


Arg 


Lys 


Val 
30 


Tyr 


Asn 


He 


Ser 


Glu 


Phe 


Thr 


Arg 


Arg 


His 


Pro 


Gly Gly 


Ser 


Arg 










35 










40 










45 


Val 


He 


Ser 


His 


Tyr 
50 


Ala 


Gly 


Gin 


Asp 


Ala 
55 


Thr 


Asp 


Pro 


Phe 


Val 
60 


Ala 


Phe 


His 


He 


Asn 
65 


Lys 


Gly 


Leu 


Val 


Lys 
70 


Lys 


Tyr 


Met 


Asn 


Ser 
75 


Leu 


Leu 


He 


Gly 


Glu 
80 


Leu 


Ser 


Pro 


Glu 


Gin 
85 


Pro 


Ser 


Phe 


Glu 


Pro 
90 


Thr 


Lys 


Asn 


Lys 


Glu 
95 


Leu 


Thr 


Asp 


Glu 


Phe 
100 


Arg 


Glu 


Leu 


Arg 


Ala 
105 


Thr 


Val 


Glu 


Arg 


Met 
110 


Gly 


Leu 


Met 


Lys 


Ala 
115 


Asn 


His 


Val 


Phe 


Phe 
120 


Leu 


Leu 


Tyr 


Leu 


Leu 


His 


He 


Leu 


Leu 


Leu 


Asp 


Gly Ala 


Ala 


Trp 










125 










130 










135 


Leu 


Thr 


Leu 


Trp 


Val 
140 


Phe 


Gly 


Thr 


Ser 


Phe 
145 


Leu 


Pro 


Phe 


Leu 


Leu 
150 


Cys 


Ala 


Val 


Leu 


Leu 


Ser 


Ala 


Val 


Gin 


Ala 


Gin 


Ala 


Gly 


Trp 


Leu 










155 










160 






165 


Gin 


His 


Asp 


Phe 


Gly 


His 


Leu 


Ser 


Val 


Phe 


Ser 


Thr 


Ser 


Lys 


Trp 










170 










175 








180 


Asn 


His 


Leu 


Leu 


His 


His 


Phe 


Val 


lie 


Gly 


His 


Leu 


Lys 


Gly 


Ala 










185 










190 








195 


Pro 


Ala 


Ser 


Trp 


Trp 
200 


Asn 


His 


Met 


His 


Phe 
205 


Gin 


His 


His 


Ala 


Lys 
210 
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Pro 


Asn 


Cys 


Phe 


Arg 
215 


Lys 


Asp 


Pro 


Asp 


He 
220 


Asn 


Met 


His 


Pro 


Phe 
225 




Phe 


Phe 


Ala 


Leu 


Gly 
230 


Lys 


He 


Leu 


Ser 


Val 
235 


Glu 


Leu 


Gly 


Lys 


Gin 
240 


5 


Lys 


Lys 


Lys 


Tyr 


Met 
245 


Pro 


Tyr 


Asn 


His 


Gin 
250 


His 


Xxx 


Tyr 


Phe 


Phe 
255 




Leu 


He 


Gly 


Pro 


Pro 
260 


Ala 


Leu 


Leu 


Pro 


Leu 
265 


Tyr 


Phe 


Gin 


Trp 


Tyr 
270 


10 


He 


Phe 


Tyr 


Phe 


Val 


He 


Gin 


Arg 


Lys 


Lys 


Trp Val 


Asp 


Leu 


Ala 










275 










280 










285 




Trp 


He 


Ser 


Lys 


Gin 
290 


Glu 


Tyr 


Asp 


Glu 


Ala 
295 


Gly 


Leu 


Pro 


Leu 


Ser 
300 




Thr 


Ala 


Asn 


Ala 


Ser 
305 


Lys 





















15 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 566 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
-(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: amino acid (Translation of Contig 2511785) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 



30 


His 
1 


Leu 


Lys 


Gly 


Ala 
5 


Ser 




Gin 


His 


His 


Ala 


Lys 
20 


Pro 




Asn 


Met 


Leu 


His 


Val 


Phe 


35 










35 






Tyr 


Gly 


Lys 


Lys 


Lys 
50 


Leu 




Glu 


Tyr 


Phe 


Phe 


Leu 


He 


40 










65 




Phe 


Gin 


Tyr 


Gin 


He 
80 


He 




Val 


Asp 


Leu 


Ala 


Trp 
95 


Ala 


45 


Thr 


Tyr 


He 


Pro 


Phe 


Tyr 










110 




Asn 


Phe 


He 


Arg 


Phe 
125 


Leu 




Gin 


Met 


Asn 


His 


He 


Val 


50 










140 




Asp 


Trp 


Phe 


Ser 


Ser 
155 


Gin 




Ser 


Phe 


Phe 


Asn 


Asp 
170 


Trp 




Glu 


His 


His 


Leu 


Phe 


Pro 


55 










185 






He 


Ala 


Pro 


Leu 


Val 
200 


Lys 




Tyr 


Gin 


Glu 


Lys 


Pro 


Leu 


60 










215 




Ser 


Leu 


Lys 


Lys 


Ser 
230 


Gly 




Lys 


+ + * 


Ser 


His 


Ser 


Pro 



Ala 


Asn 


Trp 


Trp 


Asn 


His 


Arg 


His 


Phe 








10 










15 


Asn 


He 


Phe 


His 


Lys 


Asp 


Pro 


Asp 


Val 








25 










30 


Val 


Leu 


Gly Glu 


Trp 


Gin 


Pro 


He 


Glu 








40 










45 


Lys 


Tyr 


Leu 


Pro 


Tyr 


Asn 


His 


Gin 


His 








55 










60 


Gly 


Pro 


Pro 


Leu 


Leu 


He 


Pro 


Met 


Tyr 








70 










75 


Met 


Thr 


Met 


He 


Val 


His 


Lys 


Asn 


Trp 








85 










90 


Val 


Ser 


Tyr 


Tyr 


He 


Arg 


Phe 


Phe 


He 








100 










105 


Gly 


He 


Leu 


Gly Ala 


Leu 


Leu 


Phe 


Leu 








115 










120 


Glu 


Ser 


His 


Trp 


Phe 


Val 


Trp 


Val 


Thr 








130 










135 


Met 


Glu 


He Asp 


Gin 


Glu 


Ala 


Tyr 


Arg 








145 










150 


Leu 


Thr 


Ala 


Thr 


Cys 


Asn 


Val 


Glu 


Gin 








160 










165 


Phe 


Ser 


Gly 


His 


Leu 


Asn 


Phe 


Gin 


He 








175 










180 


Thr 


Met 


Pro 


Arg 


His 


Asn 


Leu 


His 


Lys 








190 










195 


Ser 


Leu 


Cys 


Ala 


Lys 


His 


Gly 


He 


Glu 








205 










210 


Leu 


Arg 


Ala 


Leu 


Leu 


Asp 


He 


He 


Arg 








220 










225 


Lys 


Leu 


Trp 


Leu 


Asp 


Ala 


Tyr 


Leu 


His 








235 










240 


Arg Asp 


Thr 


Val 


Gly 


Lys 


Gly Cys 


Arg 
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10 



15 



20 



25 



30 



35 



40 



45 



55 



60 











245 










250 










255 


Trp 


Gly Asp 




bin 


Arg Asn Asp Gly 


Leu 


Leu 


Phe 


* * * 


Gly 


Val 




















265 








270 




GlU 


Arg 


Leu 


vai 

9 "7 ^ 


Tyr 


Ala 


Leu 


Leu 


Thr 
280 


Asp 


Pro 


Met 


Leu 


Asp 
285 


Leu 


Ser 


Pro 


true 


Leu 
z y u 


Leu 


Ser 


Phe 


Phe 


Ser 
295 


Ser 


His 


Leu 


Pro 


His 
300 


Ser 


Thr 


Leu 


Pro 


Ser 


Trp 


Asp 


Leu 


Pro 


Ser 


Leu 


Ser 


Arg 


Gin 


Pro 




















310 








315 


ber 


Ala 


Met 


Ala 


Leu 

U 


Pro 


Val 


Pro 


Pro 


Ser 
325 


Pro 


Phe 


Phe 


Gin 


Gly 
330 




Glu 


Arg 


Trp 


Pro 


Pro 


Gly Val 


Ala 


Leu 


Ser 


Tyr 


Leu 


His 


Ser 




















340 








345 


Leu 


Pro 


Leu 


Lys 


Met 


Gly Gly Asp Gin 


Arg 


Ser 


Met 


Gly 


Leu 


Ala 




















355 








360 


Cys 


Glu 


Ser 


Pro 


Leu 


Ala 


Ala 


Trp 


Ser 


Leu 


Gly 


He 


Thr 


Pro 


Ala 










365 










370 








375 


Leu 


Val 


Leu 


vain 


Met 


Leu 


Leu Gly Phe 


He 


Gly 


Ala 


Gly 


Pro 


Ser 










380 










385 








390 


Arg 


Ala Gly 


fro 


Leu 


Thr 


Leu 


Pro 


Ala 


Trp 


Leu 


His 


Ser 


Pro 


+ + + 










400 










405 










410 


Arg 


Leu 


Pro 


T Alt 

Leu 


val 


His 


Pro 


Phe 


He 


Glu 


Arg 


Pro 


Ala 


Leu 


Leu 




















420 








425 


bill 


Ser 


Ser 


uiy 


Leu 


Pro 


Pro 


Ala 


Ala 


Arg 


Leu 


Ser 


Thr 


Arg 


Gly 










H O \J 










A *3 C 

435 








440 


Leu 


Ser 


* + * 


Asp 


vai 


Gin 


Gly 


Pro 


Arg 


Pro 
450 


Ala 


Gly 


Thr 


Ala 


Ser 
455 


Pro 


Asn 


Leu 


C 1 XT 

uxy 


Pro 


Trp 


Lys 


Ser 


Pro 


Pro 
465 


Pro 


His 


His 


* * * 


Ser 
470 


rl-L a 


Leu 


Thr 


Leu 


tiiy 

4 / O 


Phe 


His 


Gly 


Pro 


His 
480 


Ser 


Thr 


Ala 


Ser 


Pro 
485 


i nr 




Ala 


Cys 


ASp 


Leu 


Gly 


Thr 


Lys 


Gly 


Gly 


Val 


Pro 


Arg 


Leu 










490 










495 








500 


Leu 


* ** 


Leu 


Ser 


Arg 


Gly 


Ser 


Gly His 


Val 


Gin 


Gly Gly 


Ala 


Gly 










505 










510 










515 


Trp 


Pro 


Gly 


Gly 


Ser 


Ala 


His 


Pro 


Pro 


Ala 


Phe 


Pro 


Gin 


Gly 


Val 










520 










525 








530 


Leu 


Arg 


Ser 


Lys 


He 


Leu 


Glu 


Gin 


Ser 


Asp 


Pro 


Ser 


Pro 


Lys 


Ala 










535 










540 








545 


Leu 


Leu 


Ser 


Ala 


Gly 


Gin 


Cys 


Gin 


Pro 


He 


Pro 


Gly 


His 


Leu 


Ala 










550 










555 








560 


Pro 


Gly 


Asp 


Val 


Gly 
565 


Pro 


Xxx 



















(2) INFORMATION FOR SEQ ID NO: 43: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 619 amino acids 
50 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: amino acid (Translation of Contig 2535) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 



Val Phe Tyr Phe Gly Asn Gly Trp He Pro Thr Leu He Thr Ala 
1 5 10 15 

Phe Val Leu Ala Thr Ser Gin Ala Gin Ala Gly Trp Leu Gin His 
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20 25 30 

Asp Tyr Gly His Leu Ser Val Tyr Arg Lys Pro Lys Trp Asn His 
35 40 45 

Leu Val His Lys Phe Val lie Gly His Leu Lys Gly Ala Ser Ala 
5 50 55 60 

Asn Trp Trp Asn His Arg His Phe Gin His His Ala Lys Pro Asn 
65 70 75 

lie Phe His Lys Asp Pro Asp Val Asn Met Leu His Val Phe Val 
80 85 90 

10 Leu Gly Glu Trp Gin Pro lie Glu Tyr Gly Lys Lys Lys Leu Lys 

95 100 105 

Tyr Leu Pro Tyr Asn His Gin His Glu Tyr Phe Phe Leu lie Gly 
110 115 120 

Pro Pro Leu Leu He Pro Met Tyr Phe Gin Tyr Gin He He Met 
15 125 130 135 

Thr Met He Val His Lys Asn Trp Val Asp Leu Ala Trp Ala Val 
140 145 150 

Ser Tyr Tyr He Arg Phe Phe He Thr Tyr He Pro Phe Tyr Gly 
155 160 165 

20 He Leu Gly Ala Leu Leu Phe Leu Asn Phe He Arg Phe Leu Glu 

170 175 180 

Ser His Trp Phe Val Trp Val Thr Gin Met Asn His He Val Met 
185 190 195 

Glu He Asp Gin Glu Ala Tyr Arg Asp Trp Phe Ser Ser Gin Leu 
25 200 205 210 

Thr Ala Thr Cys Asn Val Glu Gin Ser Phe Phe Asn Asp Trp Phe 
215 220 225 

Ser Gly His Leu Asn Phe Gin He Glu His His Leu Phe Pro Thr 
230 235 240 

30 Met Pro Arg His Asn Leu His Lys He Ala Pro Leu Val Lys Ser 

245 250 255 

Leu Cys Ala Lys His Gly He Glu Tyr Gin Glu Lys Pro Leu Leu 
260 265 270 

Arg Ala Leu Leu Asp He He Arg Ser Leu Lys Lys Ser Gly Lys 
35 275 280 285 

Leu Trp Leu Asp Ala Tyr Leu His Lys *** Ser His Ser Pro Arg 
290 295 300 

Asp Thr Val Gly Lys Gly Cys Arg Trp Gly Asp Gly Gin Arg Asn 
305 310 315 

40 Asp Gly Leu Leu Phe *** Gly Val Ser Glu Arg Leu Val Tyr Ala 

320 325 330 

Leu Leu Thr Asp Pro Met Leu Asp Leu Ser Pro Phe Leu Leu Ser 
335 340 345 

Phe Phe Ser Ser His Leu Pro His Ser Thr Leu Pro Ser Trp Asp 
45 350 355 360 

Leu Pro Ser Leu Ser Arg Gin Pro Ser Ala Met Ala Leu Pro Val 
365 370 375 

Pro Pro Ser Pro Phe Phe Gin Gly Ala Glu Arg Trp Pro Pro Gly 
380 385 390 

50 Val Ala Leu Ser Tyr Leu His Ser Leu Pro Leu Lys Met Gly Gly 

400 405 410 

Asp Gin Arg Ser Met Gly Leu Ala Cys Glu Ser Pro Leu Ala Ala 
415 420 425 

Trp Ser Leu Gly He Thr Pro Ala Leu Val Leu Gin Met Leu Leu 
55 430 435 440 

Gly Phe He Gly Ala Gly Pro Ser Arg Ala Gly Pro Leu Thr Leu 
445 450 455 

Pro Ala Trp Leu His Ser Pro *** Arg Leu Pro Leu Val His Pro 
460 465 470 

60 Phe He Glu Arg Pro Ala Leu Leu Gin Ser Ser Gly Leu Pro Pro 

475 480 485 

Ala Ala Arg Leu Ser Thr Arg Gly Leu Ser *** Asp Val Gin Gly 
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490 








495 










500 


Pro Arg 


Pro 


Ala 


Gly 


Thr Ala 


Ser 


Pro 


Asn 


Leu 


Gly Pro 


Trp 


Lys 










C f\ Q 

DUO 








510 








515 


Ser 


Pro 


Pro 


Pro 


His 
520 


His *** 


Ser 


Ala 


Leu 
525 


Thr 


Leu 


Gly 


Phe 


His 
530 


Gly 


Pro 


His 


Ser 


Thr 
535 


Ala Ser 


Pro 


Thr 


* * * 
540 


Ala 


Cys 


Asp 


Leu 


Gly 
545 


Thr 


Lys 


Gly 


Gly 


val 


Pro Arg 


Leu 


Leu 


■k + -dr 


Leu 


Ser 


Arg 


Gly 


Ser 










550 








555 








560 


Gly 


His 


Val 


Gin 


Gly 
565 


Gly Ala 


Gly 


Trp 


Pro 
570 


Gly 


Gly 


Ser 


Ala 


His 
575 


Pro 


Pro 


Ala 


Phe 


Pro 


Gin Gly Val 


Leu 


Arg 


Ser 


Lys 


He 


Leu 


Glu 










580 








585 










590 


Gin 


Ser 


Asp 


Pro 


Ser 
595 


Pro Lys 


Ala 


Leu 


Leu 
600 


Ser 


Ala 


Gly 


Gin 


Cys 
605 


Gin 


Pro 


He 


Pro 


Gly 
610 


His Leu 


Ala 


Pro 


Gly 
615 


Asp 


Val 


Gly 


Pro 


Xxx 
620 



20 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 757 amino acids 
25 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: amino acid (Translation of Contig 253538a) 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 



35 



40 



45 



50 



55 



60 



Gin 


Gly 


Pro 


Thr 


Pro 


Arg 


Tyr 


Phe 


Thr 


Trp 


Asp 


Glu 


Val 


Ala 


Gin 


1 








5 










10 










15 


Arg 


Ser 


Gly 


Cys 


Glu 


Glu 


Arg 


Trp 


Leu 


Val 


He 


Asp 


Arg 


Lys 


Val 










20 










25 










30 


Tyr 


Asn 


He 


Ser 


Glu 


Phe 


Thr 


Arg 


Arg 


His 


Pro 


Gly 


Gly 


Ser 


Arg 










35 










40 










45 


Val 


He 


Ser 


His 


Tyr 


Ala 


Gly 


Gin 


Asp 


Ala 


Thr 


Asp 


Pro 


Phe 


Val 










50 










55 










60 


Ala 


Phe 


His 


He 


Asn 


Lys 


Gly 


Leu 


Val 


Lys 


Lys 


Tyr Met 


Asn 


Ser 










65 










70 










75 


Leu 


Leu 


He 


Gly 


Glu 


Leu 


Ser 


Pro 


Glu 


Gin 


Pro 


Ser 


Phe 


Glu 


Pro 










80 










85 










90 


Thr 


Lys 


Asn 


Lys 


Glu 


Leu 


Thr 


Asp 


Glu 


Phe 


Arg 


Glu 


Leu 


Arg 


Ala 










95 










100 








105 


Thr 


Val 


Glu 


Arg 


Met 


Gly 


Leu 


Met 


Lys 


Ala 


Asn 


His 


Val 


Phe 


Phe 










110 










115 










120 


Leu 


Leu 


Tyr 


Leu 


Leu 


His 


He 


Leu 


Leu 


Leu 


Asp 


Gly 


Ala 


Ala 


Trp 










125 










130 










135 


Leu 


Thr 


Leu 


Trp 


Val 


Phe 


Gly 


Thr 


Ser 


Phe 


Leu 


Pro 


Phe 


Leu 


Leu 










140 










145 










150 


Cys 


Ala 


Val 


Leu 


Leu 


Ser 


Ala 


Val 


Gin 


Gin 


Ala 


Gin 


Ala 


Gly 


Trp 










155 










160 








165 


Leu 


Gin 


His 


Asp 


Tyr 


Gly 


His 


Leu 


Ser 


Val 


Tyr 


Arg 


Lys 


Pro 


Lys 










170 










175 










180 


Trp 


Asn 


His 


Leu 


Val 


His 


Lys 


Phe 


Val 


He 


Gly 


His 


Leu 


Lys 


Gly 










185 










190 








195 


Ala 


Ser 


Ala 


Asn 


Trp 


Trp 


Asn 


His 


Arg 


His 


Phe 


Gin 


His 


His 


Ala 










200 










205 










210 


Lys 


Pro 


Asn 


He 


Phe 


His 


Lys 


Asp 


Pro 


Asp 


Val 


Asn 


Met 


Leu 


His 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 











215 








220 










225 


val 


Phe 


Val 


Leu 


Gly 


Glu 


Trp 


Gin 


Pro He 


Glu 


Tyr 


Gly 


Lys 


Lys 










230 








235 










240 


Lys 


Leu 


Lys 


Tyr 


Leu 


Pro 


Tyr 


Asn 


His Gin 


His 


Glu 


Tyr 


Phe 


Phe 










245 








250 










255 


Leu 


He 


Gly 


Pro 


Pro 


Leu 


Leu 


He 


Pro Met 


Tyr 


Phe 


Gin 


Tyr 


Gin 










260 








265 










270 


He 


He 


Met 


Thr 


Met 


He 


Val 


His 


Lys Asn 


Trp 


Val 


Asp 


Leu 


Ala 










275 








280 










285 


Trp 


Ala 


Val 


Ser 


Tyr 


Tyr 


He 


Arg 


Phe Phe 


He 


Thr 


Tyr 


He 


Pro 










290 








295 










300 


Phe 


Tyr 


Gly 


He 


Leu 


Gly Ala 


Leu 


Leu Phe 


Leu 


Asn 


Phe 


He 


Arg 










305 








310 










315 


Phe 


Leu 


Glu 


Ser 


His 


Trp 


Phe 


Val 


Trp Val 


Thr 


Gin 


Met 


Asn 


His 










320 








325 










330 


He 


Val 


Met 


Glu 


He 


Asp 


Gin 


Glu 


Ala Tyr Arg 


Asp 


Trp 


Phe 


Ser 










335 








340 










345 


Ser 


Gin 


Leu 


Thr 


Ala 


Thr 


Cys 


Asn 


Val Glu 


Gin 


Ser 


Phe 


Phe 


Asn 










350 








355 










360 


Asp 


Trp 


Phe 


Ser 


Gly 


His 


Leu 


Asn 


Phe Gin 


He 


Glu 


His 


His 


Leu 










365 








370 










375 


Phe 


Pro 


Thr 


Met 


Pro 


Arg 


His 


Asn 


Leu His 


Lys 


He 


Ala 


Pro 


Leu 










380 








385 










390 


Val 


Lys 


Ser 


Leu 


Cys 


Ala 


Lys 


His 


Gly He 


Glu 


Tyr 


Gin 


Glu 


Lys 










400 








405 










410 


Pro 


Leu 


Leu 


Arg 


Ala 


Leu 


Leu Asp 


He He 


Arg 


Ser 


Leu 


Lys 


Lys 










415 








420 










425 


Ser 


Gly 


Lys 


Leu 


Trp 


Leu Asp Ala 


Tyr Leu 


His 


Lys 


**★ 


Ser 


His 










430 








435 










440 


Ser 


Pro 


Arg 


Asp 


Thr 


Val 


Gly Lys 


Gly Cys 


Arg 


Trp 


Gly 


Asp 


Gly 










445 








450 










455 


Gin 


Arg 


Asn 


Asp 


Gly 


Leu 


Leu 


Phe 


*** Gly Val 


Ser 


Glu 


Arg 


Leu 










460 








465 










470 


Val 


Tyr 


Ala 


Leu 


Leu 


Thr Asp 


Pro 


Met Leu Asp 


Leu 


Ser 


Pro 


Phe 










475 








480 










485 


Leu 


Leu 


Ser 


Phe 


Phe 


Ser 


Ser 


His 


Leu Pro 


His 


Ser 


Thr 


Leu 


Pro 










490 








495 










500 


Ser 


Trp 


Asp 


Leu 


Pro 


Ser 


Leu 


Ser 


Arg Gin 


Pro 


Ser 


Ala 


Met 


Ala 










505 








510 










515 


Leu 


Pro 


Val 


Pro 


Pro 


Ser 


Pro 


Phe 


Phe Gin 


Gly Ala 


Glu 


Arg 


Trp 










520 








525 










530 


Pro 


Pro 


Gly Val 


Ala 


Leu 


Ser 


Tyr 


Leu His 


Ser 


Leu 


Pro 


Leu 


Lys 










535 








540 










545 


Met 


Gly Gly Asp 


Gin 


Arg 


Ser 


Met 


Gly Leu 


Ala 


Cys 


Glu 


Ser 


Pro 










550 








555 










560 


Leu 


Ala 


Ala 


Trp 


Ser 


Leu 


Gly 


lie 


Thr Pro 


Ala 


Leu 


Val 


Leu 


Gin 










565 








570 










575 


Met 


Leu 


Leu 


Gly 


Phe 


He 


Gly Ala Gly Pro 


Ser 


Arg 


Ala 


Gly 


Pro 










580 








585 










590 


Leu 


Thr 


Leu 


Pro 


Ala 


Trp 


Leu 


His 


Ser Pro 


* * * 


Arg 


Leu 


Pro 


Leu 










595 








600 










605 


Val 


His 


Pro 


Phe 


He 


Glu 


Arg 


Pro 


Ala Leu 


Leu 


Gin 


Ser 


Ser 


Gly 










610 








615 










620 


Leu 


Pro 


Pro 


Ala 


Ala 


Arg 


Leu 


Ser 


Thr Arg 


Gly 


Leu 


Ser 


•* * * 


Asp 










625 








630 










635 


Val 


Gin 


Gly 


Pro 


Arg 


Pro 


Ala 


Gly 


Thr Ala 


Ser 


Pro 


Asn 


Leu 


Gly 










640 








645 










650 


Pro 


Trp 


Lys 


Ser 


Pro 


Pro 


Pro 


His 


His *** 


Ser 


Ala 


Leu 


Thr 


Leu 










655 








660 










665 


Gly 


Phe 


His 


Gly 


Pro 


His 


Ser 


Thr 


Ala Ser 


Pro 


Thr 


*★* 


Ala 


Cys 










670 








675 










680 


Asp 


Leu 


Gly Thr 


Lys 


Gly 


Gly Val 


Pro Arg 


Leu 


Leu 


★ * * 


Leu 


Ser 
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685 


690 






695 


Arg Gly Ser 


Gly His Val 


Gin Gly Gly Ala 


Gly 


Trp 


Pro Gly Gly 






700 


705 






710 


Ser Ala 


His 


Pro Pro Ala 


Phe Pro Gin Gly 


Val 


Leu 


Arg Ser Lys 


lie Leu 




715 


720 






725 


Glu 


Gin Ser Asp 


Pro Ser Pro Lys 


Ala 


Leu 


Leu Ser Ala 






730 


735 






740 


Gly Gin 


Cys 


Gin Pro lie 


Pro Gly His Leu 


Ala 


Pro 


Gly Asp Val 






745 


750 






755 


Gly Pro 


Xxx 











(2) INFORMATION FOR SEQ ID NO: 45: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 6 nucleic acids 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



20 



30 



50 



55 



60 



65 



(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 



25 CGTATGTCAC TCCATTCCAA ACTCGTTCAT GGTATCATAA ATATCAACAC ATTTACGCTC 60 

CACTCCTCTA TGGTATTTAC ACACTCAAAT ATCGTACTCA AGATTGGGAA GCTTTTGTAA 120 

AGGATGGTAA AAATGGTGCA ATTCGTGTTA GTGTCGCCAC AAATTTCGAT AAGGCCGCTT 180 

ACGTCATTGG TAAATTGTCT TTTGTTTTCT TCCGTTTCAT CCTTCCACTC CGTTATCATA 240 

GCTTTACAGA TTTAATTTGT TATTTCCTCA TTGCTGAATT CGTCTTTGGT TGGTATCTCA 300 

CAATTAATTT CCAAGTTAGT CATGTCGCTG AAGATCTCAA ATTCTTTGCT ACCCCTGAAA 360 

GACCAGATGA ACCATCT CAA ATCAATGAAG ATTGGGCAAT CCTTCAACTT AAAACTACTC 420 

AAGATTATGG TCATGGTTCA CTCCTTTGTA CCTTTTTTAG TGGTTCTTTA AATCATCAAG 480 

TTGTTCATCA TTTATTCCCA TCAATTGCTC AAGATTTCTA CCCACAACTT GTACCAATTG 54 0 

TAAAAGAAGT TTGTAAAGAA CATAACATTA CTTACCACAT TAAACCAAAC TTCACTGAAG 600 

15 CTATTATGTC ACACATTAAT TACCTTTACA AAATGGGTAA TGATCCAGAT TATGTTAAAA 660 

AACCATTAGC CTCAAAAGAT GATTAAATGA AATAACTTAA AAACCAATTA TTTACTTTTG 720 

ACAAACAGTA ATATTAATAA ATACAA 746 

40 (2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 227 amino acids 

(B) TYPE: amino acid 

45 (C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 



Thr 


Pro 


Phe 


Gin 


Thr 


Arg 


Ser 


Trp 


Tyr 


His 


Lys 


Tyr 


Gin 


Tyr 


Ala 


5 










10 








15 


Pro 


Leu 


Leu 


Tyr 


Gly 


He 


Tyr 


Thr 


Leu 


Lys 


Tyr 






20 










25 








30 


Gin 


Asp 


Trp 
35 


Glu 


Ala 


Phe 


Val 


Lys 
40 


Asp 


Gly 


Lys 


Asn 


Gly 
45 


Arg 


Val 


Ser 
50 


Val 


Ala 


Thr 


Asn 


Phe 
55 


Asp 


Lys 


Ala 


Ala 


Tyr 
60 


Gly 


Lys 


Leu 


Ser 


Phe 


Val 


Phe 


Phe 


Arg 


Phe 


He 


Leu 


Pro 






65 










70 








75 


Tyr 


His 


Ser 


Phe 


Thr 


Asp 


Leu 


He 


Cys 


Tyr 


Phe 


Leu 


He 


Phe 




80 










85 








90 


Val 


Phe 
95 


Gly 


Trp 


Tyr 


Leu 


Thr 
100 


He 


Asn 


Phe 


Gin 


Val 
105 
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Ser His 


Val Ala 


Glu Asp 


Leu 


Lys 


Phe 


Phe Ala 


Thr 


Pro 


Glu 


Arg 






110 








115 








120 


Pro Asp 


Glu Pro 


Ser Gin 


He 


Asn 


Glu 


Asp Trp 


Ala 


He 


Leu 


Gin 




125 








130 








135 


Leu Lys 


Thr Thr 


Gin Asp Tyr Gly His 


Gly Ser 


Leu 


Leu 


Cys 


Thr 




140 








145 








150 


Phe Phe 


Ser Gly 


Ser Leu 


Asn 


His 


Gin 


Val Val 


His 


His 


Leu 


Phe 




155 








160 








165 


Pro Ser 


He Ala 


Gin Asp 


Phe 


Tyr 


Pro 


Gin Leu 


Val 


Pro 


He 


Val 






170 








175 








180 


Lys Glu 


Val Cys 


Lys Glu 


His 


Asn 


He 


Thr Tyr 


His 


He 


Lys 


Pro 




185 








190 








195 


Asn Phe 


Thr Glu 


Ala He 


Met 


Ser 


His 


He Asn 


Tyr 


Leu 


Tyr 


Lys 






200 








205 








210 


Met Gly Asn Asp 


Pro Asp 


Tyr 


Val 


Lys 


Lys Pro 


Leu 


Ala 


Ser 


Lys 






215 








220 








225 



Asp Asp *** 



20 (2) INFORMATION FOR SEQ ID NO 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 94 nucleic acids 

(B) TYPE: nucleic acid 

25 (C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

30 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 47: 



TTTTGGAAGG NTCCAAGTTN ACCACGGANT NGGCAAGTTN ACGGGGCGGA AANCGGTTTT 60 

CCCCCCAAGC CTTTTGTCGA CTGGTTCTGT GGTGGCTTCC AGTACCAAGT CGACCACCAC 120 

35 TTATTCCCCA GCCTGCCCCG ACACAAT CTG GCCAAGACAC ACGCACTGGT CGAATCGTTC 180 

TGCAAGGAGT GGGGTGTCCA GTACCACGAA GCCGACCTCG TGGACGGGAC CATGGAAGTC 24 0 

TTGCACCATT TGGGCAGCGT GGCCGGCGAA TTCGTCGTGG ATTTTGTACG CGACGGACCC 300 

GCCATGTAAT CGTCGTTCGT GACGATGCAA GGGTTCACGC ACATCTACAC ACACTCACTC 360 

ACACAACTAG TGTAACTCGT ATAGAATTCG GTGTCGACCT GGACCTTGTT TGACTGGTTG 420 

40 GGGATAGGGT AGGTAGGCGG ACGCGTGGGT CGNCCCCGGG AATTCTGTGA CCGGTACCTG 480 

GCCCGCGTNA AAGT 494 



45 (2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

50 (C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 

Phe Trp Lys Xxx Pro Ser Xxx Pro Arg Xxx Xxx Gin Val Xxx Gly 
15 10 15 

Ala Glu Xxx Gly Phe Pro Pro Lys Pro Phe Val Asp Trp Phe Cys 
60 20 25 30 

Gly Gly Phe Gin Tyr Gin Val Asp His His Leu Phe Pro Ser Leu 

35 40 45 

Pro Arg His Asn Leu Ala Lys Thr His Ala Leu Val Glu Ser Phe 

50 55 60 

65 Cys Lys Glu Trp Gly Val Gin Tyr His Glu Ala Asp Leu Val Asp 

65 70 75 
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Gly Thr Met Glu Val Leu His His Leu Gly Ser Val Ala Gly Glu 
65 70 75 

Phe Val Val Asp Phe Val Arg Asp Gly Pro Ala Met 
80 85 



10 



20 



(2) INFORMATION FOR SEQ ID NO: 49: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 nucleic acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: nucleic acid 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 49: 



GGATGGAGTT CGTCTGGATC GCTGTGCGCT ACGCGACGTG GTTTAAGCGT CATGGGTGCG 60 

CTTGGGTACA CGCCGGGGCA GTCGTTGGGC ATGTACTTGT GCGCCTTTGG TCTCGGCTGC 120 

ATTTACATTT TTCTGCAGTT CGCCGTAAGT CACACCCATT TGCCCGTGAG CAACCCGGAG 180 

GATCAGCTGC ATTGGCTCGA GTACGCGCGG ACCACACTGT GAACATCAGC ACCAAGTCGT 240 

GGTTTGTCAC ATGGTGGATG TCGAACCTCA ACTTTCAGAT CGAGCACCAC CTTTTCCCCA - 300 

CGGCGCCCCA GTTCCGTTTC AAGG AG AT CA GCCCGCGCGT CGAGGCCCTC TTCAAGCGCC 360 

ACGGTCTCCC TTACTACGAC ATGCCCTACA CGAGCGCCGT CTCCACCACC TTTGCCAACC 420 

TCTACTCCGT CGGCCATTCC GTCGGCGACG CCAAGCGCGA CTAGCCTCTT TTCCTAGACC 480 

TTAATTCCCC ACCCCACCCC ATGTTCTGTC TTCCTCCCGC 520 



35 (2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 

40 (C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 





Met 


Glu 


Phe 


Val 


Trp 


He 


50 


1 








5 




Arg 


His 


Gly 


Cys 


Ala 
20 


Trp 




Val 


Leu 


Val 


Arg 


Leu 
35 


Trp 


55 


Val 


Arg 


Arg 


Lys 


Ser 


His 








50 






Ser 


Ala 


Ala 


Leu 


Ala 
65 


Arg 




Ser 


Thr 


Lys 


Ser 


Trp 


Phe 


60 










80 




Phe 


Gin 


lie 


Glu 


His 
95 


His 




Phe 


Lys 


Glu 


He 


Ser 
110 


Pro 


65 


Gly 


Leu 


Pro 


Tyr 


Tyr 
125 


Asp 




Thr 


Phe 


Ala 


Asn 


Leu 


Tyr 



Ala 


Val 


Arg 


Tyr 
10 


Ala 


Thr 


Trp 


Phe 


Lys 
15 


Val 


His 


Ala 


Gly 
25 


Ala 


Val 


Val 


Gly 


His 
30 


Ser 


Arg 


Leu 


His 
40 


Leu 


His 


Phe 


Ser 


Ala 
45 


Pro 


Phe 


Ala 


Arg 


Glu 


Gin 


Pro 


Gly Gly 








55 










60 


Val 


Arg 


Ala 


Asp 
70 


His 


Thr 


Val 


Asn 


He 
75 


Val 


Thr 


Trp 


Trp 
85 


Met 


Ser 


Asn 


Leu 


Asn 
90 


Leu 


Phe 


Pro 


Thr 
100 


Ala 


Pro 


Gin 


Phe 


Arg 
105 


Arg 


Val 


Glu 


Ala 
115 


Leu 


Phe 


Lys 


Arg 


His 
120 


Met 


Pro 


Tyr 


Thr 
130 


Ser 


Ala 


Val 


Ser 


Thr 
135 


Ser 


Val 


Gly 


His 


Ser 


Val 


Gly Asp Ala 
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15 



140 _ 145 150 

Lys Arg Asp 



(2) INFORMATION FOR SEQ ID NO: 51: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 429 nucleic acids 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 



ACGCGTCCGC CCACGCGTCC GCCGCGAGCA ACTCATCAAG GAAGGCTACT TTGACCCCTC 60 

20 GCTCCCGCAC ATGACGTACC GCGTGGTCGA GATTGTTGTT CTCTTCGTGC TTTCCTTTTG 120 

GCTGATGGGT CAGTCTTCAC CCCTCGCGCT CGCTCTCGGC ATTGTCGTCA GCGGCATCTC 180 

TCAGGGTCGC TGCGGCTGGG TAATGCATGA GATGGGCCAT GGGTCGTTCA CTGGTGTCAT 240 

TTGGCTTGAC GACCGGTTGT GCGAGTTCTT TTACGGCGTT GGTTGTGGCA TGAGCGGTCA 300 

TTACTGGAAA AACCAGCACA GCAAACACCA CGCAGCGCCA AACCGGCTCG AGCACGATGT 360 

25 AGATCTCAAC ACCTTGCCAT TGGTGGCCTT CAACGAGCGC GTCGTGCGCA AGGTCCGACC 420 



30 



(2) INFORMATION FOR SEQ ID NO: 52: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
35 (D) TOPOLOGY: linear 



40 



45 



50 



55 



60 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 



Arg Val 


Arg 


Pro Arg Val 


Arg 


Arg 


Glu 


Gin 


Leu 


He Lys 


Glu 


Gly 


1 






5 








10 








15 


Tyr Phe 


Asp 


Pro 


Ser Leu 


Pro 


His 


Met 


Thr 


Tyr 


Arg Val 


Val 


Glu 








20 








25 








30 


lie Val 


Val 


Leu 


Phe Val 


Leu 


Ser 


Phe 


Trp 


Leu 


Met Gly 


Gin 


Ser 








35 








40 








45 


Ser Pro 


Leu 


Ala 


Leu Ala 


Leu 


Gly 


He 


Val 


Val 


Ser Gly 


He 


Ser 








50 








55 








60 


Gin Gly Arg 


Cys 


Gly Trp Val 


Met 


His 


Glu 


Met 


Gly His 


Gly 


Ser 








65 








70 








75 


Phe Thr 


Gly 


Val 


He Trp 


Leu Asp 


Asp Arg 


Leu 


Cys Glu 


Phe 


Phe 








65 








70 








75 


Tyr Gly Val 


Gly Cys Gly Met 


Ser 


Gly His 


Tyr Trp Lys Asn 


Gin 








80 








85 








90 


His Ser 


Lys 


His 


His Ala 


Ala 


Pro 


Asn 


Arg 


Leu 


Glu His 


Asp 


Val 








95 








100 








105 


Asp Leu 


Asn 


Thr 


Leu Pro 


Leu 


Val 


Ala 


Phe 


Asn 


Glu Arg Val 


Val 








110 








115 








120 


Arg Lys 


Val 


Arg 


Pro 
























125 
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What is claimed is : 

1 . A nucleic acid construct comprising: 

One or more nucleotide sequences depicted in a SEQ ID NO: selected from 
the group consisting of SEQ ID NO: 1, SEQ ID NO:3 and SEQ ID NO:5, 
5 wherein said one or more nucleotide sequences is linked to a heterologous 

nucleotide sequence. 

2. A nucleic acid construct comprising: 

One or more nucleotide sequences depicted in a SEQ ID NO: selected from 
1 0 the group consisting of SEQ ID NO: 1 , SEQ ID NO:3 and SEQ ID NO:5, 

wherein said one or more nucleotide sequences is operably associated with an 
expression control sequence functional in a plant cell. 

3. The nucleic acid construct according to claim 2, wherein said nucleotide 
1 5 sequence has an average A + T content of less than about 60%. 

4. The nucleic acid construct according to claim 2, wherein said nucleotide 
sequence is derived from a fungus. 

20 5. The nucleic acid construct according to claim 4, wherein said fungus is of 

the genus Mortierella 

6. The nucleic acid construct according to claim 5, wherein said fungus is of 
the species alpina. 



25 



7. A nucleic acid construct comprising: 

A nucleotide sequence which encodes a polypeptide comprising an amino 
acid sequence depicted in SEQ ID NO:2, wherein said nucleotide sequence is 
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operably associated with a transcription or an expression control sequence 
function in a plant cell, wherein said nucleotide sequence encodes a functionally 
active polypeptide which desaturates a fatty acid molecule at carbon 6 from the 
carboxyl end of said fatty acid molecule. 

5 

8. A nucleic acid construct comprising: 

A nucleotide sequence which encodes a polypeptide comprising an 
amino acid sequence depicted in SEQ ID NO:4, wherein said nucleotide 
sequence is operably associated with a transcription or an expression control 
10 sequence functional in a plant cell, wherein said nucleotide sequence encodes a 

functionally active polypeptide which desaturates a fatty acid molecule at 
carbon 12 from the carboxyl end of said fatty acid molecule. 



9. A nucleic acid construct comprising: 

15 A nucleotide sequence which encodes a polypeptide comprising an 

amino acid sequence depicted in SEQ ID NO:6, wherein said nucleotide 
sequence is operably associated with a transcription or an expression control 
sequence function in a plant cell, wherein said nucleotide sequence encodes a 
functionally active polypeptide which desaturates a fatty acid molecule at 

20 carbon 5 from the carboxyl end of said fatty acid moleculle. 

10. A nucleic acid construct comprising: 

at least one nucleotide sequence which encodes a functionally active 
desaturase having an amino acid sequence depicted in a SEQ ID NO: selected 
25 from the group consisting of SEQ ID NO:2, SEQ ID NO:4 and SEQ ID NO:6, 

wherein said nucleotide sequence is operably associated with a promoter 
functional in a plant cell. 
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11. The nucleic acid construct according to claim 10, wherein said plant cell is a 
seed cell. 

12. The nucleic acid construct according to claim 11, wherein said seed cell is 
5 an embryo cell. 

13. A recombinant plant cell comprising: 

At least one copy of a DN A sequence which encodes at least one 
functionally active Mortierella alpina fatty acid desaturase which results in the 

10 production of a polyunsaturated fatty acid, wherein said fatty acid desaturase 

has an amino acid sequence as depicted in a SEQ ID NO: selected from the 
group consisting of SEQ ID NO:2, SEQ ID NO:4, and SEQ ID NO:6, wherein 
said cell was transformed with a vector comprising said DNA sequence, and 
wherein said DNA sequence is operably associated with an expression control 

15 sequence. 

14. The recombinant plant cell of claim 13, wherein said polyunsaturated fatty 
acid is selected from the group consisting of LA, ARA, GLA, DGLA, SDA 
and EPA. 

20 

15. The recombinant plant cell of claim 13, wherein said recombinant plant cell 
is enriched in a fatty acid selected from the group consisting of 18:1, 18:2, 
18:3 and 18:4. 

25 16. The recombinant plant cell of claim 15, wherein said plant cell is selected 

from the group consisting of Brassica, soybean, safflower, corn, flax, and 
sunflower. 
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17. The recombinant plant cell according to claim 16, wherein said expression 
control sequence is endogenous to said plant cell. 

18. One or more plant oils expressed by said recombinant plant cell of claim 16. 

5 

19. A method for obtaining altered long chain polyunsaturated fatty acid 
biosynthesis comprising the steps of: 

growing a plant having cells which contain a transgene encoding a 
transgene expression product which desaturates a fatty acid molecule at carbon 
10 5 from the carboxyl end of said fatty acid molecule, wherein said transgene is 

operably associated with an expression control sequence, under conditions 
whereby said transgene is expressed, whereby long chain polyunsaturated fatty 
acid biosynthesis in said cells is altered. 

15 20. A method for obtaining altered long chain polyunsaturated fatty acid 

biosynthesis comprising the steps of: 

growing a plant having cells which contain one or more transgenes, 
derived from a fungus or algae, which encodes a transgene expression product 
which desaturates a fatty acid molecule at a carbon selected from the group 
20 consisting of carbon 5, carbon 6 and carbon 12 from the carboxyl end of said 

fatty acid molecule, wherein said one or more transgenes is operably associated 
with an expression control sequence, under conditions whereby said one or 
more transgenes is expressed, whereby long chain polyunsaturated fatty acid 
biosynthesis in said cells is altered. 

25 

21. The method according to claims 19 or 20, wherein said long chain 
polyunsaturated fatty acid is selected from the group consisting of LA, ARA, 
GLA, DGLA, SDA and EPA. 
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22. A plant oil or fraction thereof produced according to the method of claims 
19 or 20. 



23. A method of treating or preventing malnutrition comprising administering 

5 said plant oil of claim 22 to a patient in need of said treatment or prevention 

in an amount sufficient to effect said treatment or prevention. 

24. A pharmaceutical composition comprising said plant oil or fraction of claim 
22 and a pharmaceutically acceptable carrier. 

10 

25. The pharmaceutical composition of claim 24, wherein said pharmaceutical 
composition is in the form of a solid or a liquid. 

26. The pharmaceutical composition of claim 25, wherein said pharmaceutical 
15 composition is in a capsule or tablet form. 



27. The pharmaceutical composition of claim 24 further comprising at least one 
nutrient selected from the group consisting of a vitamin, a mineral, a 
carbohydrate, a sugar, an amino acid, a free fatty acid, a phospholipid, an 
20 antioxidant, and a phenolic compound. 



28. A nutritional formula comprising said plant oil or fraction thereof of claim 
22. 



25 29. The nutritional formula of claim 28, wherein said nutritional formula is 

selected from the group consisting of an infant formula, a dietary 
supplement, and a dietary substitute. 
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30. The nutritional formula of claim 29, wherein said infant formula, dietary 
supplement or dietary supplement is in the form of a liquid or a solid. 

31. An infant formula comprising said plant oil or fraction thereof of claim 22. 

5 

32. The infant formula of claim 31 further comprising at least one macronutrient 
selected from the group consisting of coconut oil, soy oil, canola oil, mono- 
and diglycerides, glucose, edible lactose, electrodialysed whey, 
electrodialysed skim milk, milk whey, soy protein, and other protein 

10 hydrolysates. 

33. The infant formula of claim 32 further comprising at least one vitamin 
selected from the group consisting of Vitamins A, C, D, E, and B complex; 
and at least one mineral selected from the group consisting of calcium, 

15 magnesium, zinc, manganese, sodium, potassium, phosphorus, copper, 

chloride, iodine, selenium, and iron. 

34. A dietary supplement comprising said plant oil or fraction thereof of claim 
22. 

20 

35. The dietary supplement of claim 34 further comprising at least one 
macronutrient selected from the group consisting of coconut oil, soy oil, 
canola oil, mono- and diglycerides, glucose, edible lactose, electrodialysed 
whey, electrodialysed skim milk, milk whey, soy protein, and other protein 

25 hydrolysates. 

36. The dietary supplement of claim 35 further comprising at least one vitamin 
selected from the group consisting of Vitamins A, C, D, E, and B complex; 
and at least one mineral selected from the group consisting of calcium, 
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magnesium, zinc, manganese, sodium, potassium, phosphorus, copper, 
chloride, iodine, selenium, and iron. 

37. The dietary supplement of claim 34 or claim 36, wherein said dietary 
5 supplement is administered to a human or an animal. 

38. A dietary substitute comprising said plant oil or fraction thereof of claim 22. 

39. The dietary substitute of claim 38 further comprising at least one 

10 macronutrient selected from the group consisting of coconut oil, soy oil, 

canola oil, mono- and diglycerides, glucose, edible lactose, electrodialysed 
whey, electrodialysed skim milk, milk whey, soy protein, and other protein 
hydrolysates. 

15 40. The dietary substitute of claim 39 further comprising at least one vitamin 

selected from the group consisting of Vitamins A, C, D, E, and B complex; 
and at least one mineral selected from the group consisting of calcium, 
magnesium, zinc, manganese, sodium, potassium, phosphorus, copper, 
chloride, iodine, selenium, and iron. 

20 

41. The dietary substitute of claim 38 or claim 40, wherein said dietary 
substitute is administered to a human or animal. 

42. A method of treating a patient having a condition caused by insuffient 

25 intake or production of polyunsaturated fatty acids comprising administering 

to said patient said dietary substitute of claim 38 or said dietary supplement 
of claim 34 in an amount sufficient to effect said treatment. 
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43. The method of claim 42, wherein said dietary substitute or said dietary 
supplement is administered enterally or parenterally. 

44. A cosmetic comprising said plant oil or fraction thereof of claim 22. 

5 

45. The cosmetic of claim 44, wherein said cosmetic is applied topically. 

46. The pharmaceutical composition of claim 24, wherein said pharmaceutical 
composition is administered to a human or an animal. 

10 

47. An animal feed comprising said plant oil or fraction thereof of claim 22. 

48. An isolated nucleotide sequence comprising the nucleotide sequence 
selected from the group consisting of SEQ ID NO:38 - SEQ ID NO:44 

15 wherein said nucleotide sequence is expressed in a plant cell. 



49. The method of claim 20 wherein said fungus is Mortierella species. 



50. The method of claim 49 wherein said fungus is Mortierella alpina. 

20 

5 1. An isolated nucleotide sequence selected from the group consisting of SEQ 
ID NO:49 - SEQ ID NO:50 wherein said sequence is expressed in a plant 
cell. 
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